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<220> 

<221> misc_f eature 
<222> (1) 

<223> n is any nucleotide 



<400> 1 

ngagctcagg 

ccttagagac 

gaaatacgcc 

ctttgtcaac 

gactaaaagg 

ggccaggaca 

ggcaccgctc 

cgcccaccgc 

cagccccaaa 

accctcttaa 

cccaaaatga 

gggtttgacg 

gaatgcagcg 

ccaggcaaga 

atctcaccta 

cacagttctt 

gccaccttgc 

aatcagctcc 

acagccccca 

gtccaaaaaa 

ttagccctgc 

aataccactt 

tgtttaaaac 

ctagcagact 

atgcagttct 

gacttaggtg 

tgtgccctaa 

caaaactgga 

ggggatgagc 

gtacagttca 

acaggcctag 



acaggtagaa 
tggcttaaaa 
caaccgtttt 
cttgctcaat 
gctgcaaggc 
tccacaaact 
tcaaggattg 
cataaaggtt 
aacccctgga 
gataagactc 
aactcccaac 
acccccgcaa 
gagggcaggt 
cggcctactt 
gcgggggaga 
gttatactga 
ttaaaatacg 
tacagtcccc 
tccatatctc 
ggctagaaca 
ccaaagtcag 
ttaggttact 
taggtacccc 
ccctagcgaa 
ccaactcgtc 
cagtcacctt 
acgggtcagt 
ccagactttg 
cagtccccat 
tccctttact 
gtgtctccgt 



agaatgaata 
gattggagac 
cggctcaccc 
tccttctccc 
gtgcaggccc 
agccacccat 
gagcctcgtt 
gacgggatcg 
ccagaaactc 
tcccgtgtct 
aggaatggtc 
ggctatcgca 
atccgaggcc 
aatgaccaac 
actccagaac 
ataccggcaa 
gtctgggagc 
ttgtaggggc 
cgatggtgga 
aattcataag 
agatgacctt 
ccagatgtcc 
tacccctctt 
tgcctcctgt 
ctgtttatct 
tactaactgc 
cttcctctgt 
cgtccaagcc 
tcctgccatt 
agctggactg 
cacccagtat 



gaacaataaa 
gcctcctatc 
catatgaaat 
cctccgatcc 
aaatctggac 
ttcaggtggg 
ggaagggacc 
ccgcctggat 
ccaaaacctg 
gactgctaat 
attttatgta 
ttagtacaaa 
ccaccgaact 
caaaaatgga 
tgcccctgta 
tgcaggcgaa 
ctcaacgagg 
tctataaatc 
ggacccctcg 
gctatgactc 
agccttgatg 
aattttagcc 
gcgataccca 
cagattatac 
tcccctttca 
acctctgtag 
ggaaataaca 
tccctcctcc 
gatcattata 
ggaatcaccg 
acaaaattat 



agagaccctt 
tctggctttg 
cctttatggg 
taagactgat 
acccctggcc 
agactccgtg 
ttacatcgtc 
tcacgcatcg 
gaagctccgc 
ccaccttgtc 
gcctaataat 
aacaacatgg 
ccatccaaca 
aatgcagagt 
acactttcca 
ttaataagac 
tacagatatt 
agcccgtttg 
atactaagag 
ctgaacttca 
cacggacttt 
ttgcccaaga 
ctccctcttt 
ctcccctctt 
ttaacgatac 
ccaatgtcag 
tggcatacac 
ccgacattga 
tacatagacc 
cagcattcac 
cccatcagtt 



actaaattga 
ttaagagcca 
ggaccccccc 
ttacaagccc 
gaattgtacc 
tacgtccggc 
ctgctgacca 
cacgccaagg 
cgttcggaga 
cctgtactaa 
agttcgggca 
taaaccatgc 
ggtaacttgc 
cactccaaaa 
ggactcgatg 
atactacacg 
acaaaacccc 
ctggagtgcc 
agtgtggaca 
ataccacccc 
tgatatcctg 
ttgttggctc 
aacctactcc 
ggttcaaccg 
ggaacaaata 
tagtccttta 
ctatttaccc 
catcaacccg 
taaacgagct 
caccggagct 
aatatctgat 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1740 

1800 

1860 



c: : ; 



gtccaagtct 
gtagttctcc 
ttagccttac 
ataagaaccc 
tggaccgggc 
ctcctactca 
gatagactta 
gaggaagaag 
ccccccacct 
ccccacccta 
cgttcgcaac 



tatccggtac 
aaaataggag 
aagaaaaatg 
tacaagaaga 
tgcagggctt 
tactaaccat 
atgttgtaca 
ctcaggattg 
ttgctacata 
gccggaaagt 
ccgggctccg 



catacaagat 
gggactggac 
ctgtttttat 
attacaaaaa 
tcttccgtac 
tgggccatgc 
tgccatggtg 
agcttccggg 
aataaccgct 
ccccagccgc 
agttgcatca 



ttacaagacc 
ctactaacgg 
gctaacaagt 
cgcagggaaa 
ctcctacctc 
gttttcagtc 
ctggcccagc 
acaaaagcag 
ttcatttcgc 
tacgcaaccc 
gccgaaagaa 



aggtagactc 
cagaacaagg 
caggaattgt 
gcctggcaac 
tcctgggacc 
gcctcatggc 
aataccaagc 
gggggaatga 
ttctgtaaaa 
gggccccgag 
acttcatttc 



gttagctgaa 
aggaatttgt 
gagaaacaaa 
caaccctctc 
cctactcacc 
cttcattaat 
actcaaagct 
gaagtcagaa 
cgcttatgcg 
ttgcatcagc 
ccaagctt 



<210> 2 
<211> 7616 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: 
construct 



1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2518 



Portion of 



<400> 2 

aatgaaagac cccacctgta ggtttggcaa 
atggaaaaat acataactga gaatagagaa 
agctgaatat gggccaaaca ggatatctgt 
aagaacagat ggaacagctg aatatgggcc 
ccccggctca gggccaagaa cagatggtcc 
agagaaccat cagatgtttc cagggtgccc 
tgaactaacc aatcagttcg cttctcgctt 
ataaaagagc ccacaacccc tcactcgggg 
ggtacccgtg tatccaataa accctcttgc 
cttgggaggg tctcctctga gtgattgact 
ctcgtccggg atcgggagac ccctgcccag 
tggaagcttc tgcagcatcg ttctgtgttg 
tgagaatatg ggccagactg ttaccactcc 
tgtcgagcgg atcgctcaca accagtcggt 
ctgctctgca gaatggccaa cctttaacgt 
agacctcatc acccaggtta agatcaaggt 
ccaggtcccc tacatcgtga cctgggaagc 
gccctttgta caccctaagc ctccgcctcc 
tgaacctcct cgttcgaccc cgcctcgatc 
aggcgccaaa cctaaacctc aagttctttc 
tacagaagac cccccgcctt atagggaccc 
tggtggagaa gcgacccctg cgggagaggc 
acgtgggaga cgggagcccc ctgtggccga 
cgcaggagga aacggacagc ttcaatactg 
gaaaaataat aacccttctt tttctgaaga 
tgttctcatc acccatcagc ccacctggga 
gaccggagaa gaaaaacaac gggtgctctt 
tgggcgcccc actcaactgc ccaatgaagt 
ctgggattac accacccagg caggtaggaa 
agcgggtctc caaaacgcgg gcagaagccc 
acaagggccc aatgagtctc cctcggcctt 
gtacactcct tatgaccctg aggacccagg 
ttggcagtct gccccagaca ttgggagaaa 
gacgcttgga gatttggtta gagaggcaga 
agaaagagag gaacgtatca ggagagaaac 
ggatgagcag aaagagaaag aaagagatcg 
ggccactgtc gttagtggac agaaacagga 
actcgatcgc gaccagtgtg cctactgcaa 
caagaaacca cgaggacctc ggggaccaag 
ctagggaggt cagggtcagg agcccccccc 
gcaacccgtc accttcctgg tagatactgg 
tggaccccta agtgataagt ctgcctgggt 
ctggaccacg gatcgcaaag tacatctagc 
tgtaccagac tgtccctatc ctctgttagg 
aatccacttt gagggatcag gagctcaggt 



gctagcttaa gtaacgccat tttgcaaggc 60 
gttcagatca aggtcaggaa cagatggaac 120 
ggtaagcagt tcctgccccg gctcagggcc 180 
aaacaggata tctgtggtaa gcagttcctg 240 
ccagatgcgg tccagccctc agcagtttct 300 
caaggacctg aaatgaccct gtgccttatt 360 
ctgttcgcgc gcttctgctc cccgagctca 420 
cgccagtcct ccgattgact gagtcgcccg 480 
agttgcatcc gacttgtggt ctcgctgttc 540 
acccgtcagc gggggtcttt catttggggg 600 
ggaccaccga cccaccaccg ggaggtaagc 660 
tctctgtctg actgtgtttc tgtatttgtc 720 
cttaagtttg accttaggtc actggaaaga 780 
agatgtcaag aagagacgtt gggttacctt 840 
cggatggccg cgagacggca cctttaaccg 900 
cttttcacct ggcccgcatg gacacccaga 960 
cttggctttt gacccccctc cctgggtcaa 1020 
tcttcctcca tccgccccgt ctctccccct 1080 
ctccctttat ccagccctca ctccttctct 1140 
tgacagtggg gggccgctca tcgacctact 1200 
aagaccaccc ccttccgaca gggacggaaa 1260 
accggacccc tccccaatgg catctcgcct 1320 
ctccactacc tcgcaggcat tccccctccg 1380 
gccgttctcc tcttctgacc tttacaactg 1440 
tccaggtaaa ctgacagctc tgatcgagtc 1500 
cgactgtcag cagctgttgg ggactctgct 1560 
agaggctaga aaggcggtgc ggggcgatga 1620 
cgatgccgct tttcccctcg agcgcccaga 1680 
ccacctagtc cactatcgcc agttgctcct 1740 
caccaatttg gccaaggtaa aaggaataac 1800 
cctagagaga cttaaggaag cctatcgcag 1860 
gcaagaaact aatgtgtcta tgtctttcat 1920 
gttagagagg ttagaagatt taaaaaacaa 1980 
aaagatcttt aataaacgag aaaccccgga 2040 
agaggaaaaa gaagaacgcc gtaggacaga 2100 
taggagacat agagagatga gcaagctatt 2160 
tagacaggga ggagaacgaa ggaggtccca 2220 
agaaaagggg cactgggcta aagattgtcc 2280 
accccagacc tccctcctga ccctagatga 2340 
tgaacccagg ataaccctca aagtcggggg 2400 
ggcccaacac tccgtgctga cccaaaatcc 2460 
ccaaggggct actggaggaa agcggtatcg 2520 
taccggtaag gtcacccact ctttcctcca 2580 
aagagatttg ctgactaaac taaaagccca 2640 
tatgggacca atggggcagc ccctgcaagt 2700 



gttgacccta aatatagaag atgagcatcg 
ttctctaggg tccacatggc tgtctgattt 
gggactggca gttcgccaag ctcctctgat 
gtccataaaa caatacccca tgtcacaaga 
gagactgttg gaccagggaa tactggtacc 
acccgttaag aaaccaggga ctaatgatta 
caagcgggtg gaagacatcc accccaccgt 
cccaccgtcc caccagtggt acactgtgct 
actccacccc accagtcagc ctctcttcgc 
ctcaggacaa ttgacctgga ccagactccc 
tgatgaggca ctgcacagag acctagcaga 
gctacagtac gtggatgact tactgctggc 
tactcgggcc ctgttacaaa ccctagggaa 
ccaaatttgc cagaaacagg tcaagtatct 
gctgactgag gccagaaaag agactgtgat 
actaagggag ttcctaggga cggcaggctt 
aatggcagcc cccttgtacc ctctcaccaa 
ccaacaaaag gcctatcaag aaatcaagca 
gccagatttg actaagccct ttgaactctt 
tgtcctaacg caaaaactgg gaccttggcg 
agacccagta gcagctgggt ggcccccttg 
gacaaaggat gcaggcaagc taaccatggg 
agtagaggca ctagtcaaac aaccccccga 
ctatcaggcc ttgcttttgg acacggaccg 
cccggctacg ctgctcccac tgcctgagga 
ggccgaagcc cacggaaccc gacccgacct 
cacctggtac acggatggaa gcagtctctt 
ggtgaccacc gagaccgagg taatctgggc 
gcgggctgaa ctgatagcac tcacccaggc 
tgtttatact gatagccgtt atgcttttgc 
aaggcgtggg ttgctcacat cagaaggcaa 
cctactaaaa gccctctttc tgcccaaaag 
aaagggacac agcgccgagg ctagaggcaa 
agccatcaca gagactccag acacctctac 
ctcagaacat tttcattaca cagtgactga 
ttatgataaa acaaagaagt attgggtcta 
tacttttgaa ttattagact ttcttcatca 
ggctctccta gagagaagcc acagtcccta 
aaatatcact gagacctgca aagcttgtgc 
acagggaact agggtccgcg ggcatcggcc 
gataaagccc ggattgtatg gctataaata 
ctggatagaa gccttcccaa ccaagaaaga 
agaggagatc ttccccaggt tcggcatgcc 
cttcgtctcc aaggtgagtc agacagtggc 
ttgtgcatac agaccccaaa gctcaggcca 
gactttaact aaattaacgc ttgcaactgg 
agccctgtac cgagcccgca acacgccggg 
atatggggca cccccgcccc ttgtaaactt 
cagcccctct ctccaagctc acttacaggc 
acctctggcg gcagcctacc aagaacaact 
agtcggcgac acagtgtggg tccgccgaca 
aggaccttac acagtcctgc tgaccacccc 
ttggatacac gccgcccacg tgaaggctgc 
gacatggcgc gttcaacgct ctcaaaaccc 
ctaatcccct taattcttct gatgctcaga 
gcggcccagc cggccaccat gaaaacattt 
gaagtagcga cagagaagat tacaatgctt 
gcaattcgta cgaaaacagg agaaatcatt 
cgagtaactg tttgtgcaga agccattgcg 
gattttgaca cgattgtagc tgttagacac 
cgagtggtaa gtccttgtgg tatgtgtagg 
tttgtgttaa tagaaatgaa tggcaagtta 
ctcaaatata cccgaaatta aaagttttac 
aaagacagga tatcagtggt ccaggctcta 
cctatagagt acgagccata gataaaataa 
ggaatgaaag accccacctg taggtttggc 
gcatggaaaa atacataact gagaatagag 



y 

gctacatgag acctcaaaag agccagatgt 2760 
tcctcaggcc tgggcggaaa ccgggggcat 2820 
catacctctg aaagcaacct ctacccccgt 2880 
agccagactg gggatcaagc cccacataca 2940 
ctgccagtcc ccctggaaca cgcccctgct 3000 
taggcctgtc caggatctga gagaagtcaa 3060 
gcccaaccct tacaacctct tgagcgggct 3120 
tgatttaaag gatgcctttt tctgcctgag 3180 
ctttgagtgg agagatccag agatgggaat 3240 
acagggtttc aaaaacagtc ccaccctgtt 3300 
cttccggatc cagcacccag acttgatcct 3360 
cgccacttct gagctagact gccaacaagg 3420 
cctcgggtat cgggcctcgg ccaagaaagc 3480 
ggggtatctt ctaaaagagg gtcagagatg 3540 
ggggcagcct actccgaaga cccctcgaca 3600 
ctgtcgcctc tggatccctg ggtttgcaga 3660 
aacggggact ctgtttaatt ggggcccaga 3720 
agctcttcta actgccccag ccctggggtt 3780 
tgtcgacgag aagcagggct acgccaaagg 3840 
tcggccggtg gcctacctgt ccaaaaagct 3900 
cctacggatg gtagcagcca ttgccgtact 3960 
acagccacta gtcattctgg ccccccatgc 4020 
ccgctggctt tccaacgccc ggatgactca 4080 
ggtccagttc ggaccggtgg tagccctgaa 4140 
agggctgcaa cacaactgcc ttgatatcct 4200 
aacggaccag ccgctcccag acgccgacca 4260 
acaagaggga cagcgtaagg cgggagctgc 4320 
taaagccctg ccagccggga catccgctca 4380 
cctaaagatg gcagaaggta agaagctaaa 4440 
tactgcccat atccatggag aaatatacag 4500 
agagatcaaa aataaagacg agatcttggc 4560 
acttagcata atccattgtc caggacatca 4620 
ccggatggct gaccaagcgg cccgaaaggc 4680 
cctcctcata gaaaattcat caccctacac 4740 
tataaaggac ctaaccaagt tgggggccat 4800 
ccaaggaaaa cctgtgatgc ctgaccagtt 4860 
gctgactcac ctcagcttct caaaaatgaa 4920 
ctacatgctg aaccgggatc gaacactcaa 4980 
acaagtcaac gccagcaagt ctgccgttaa 5040 
cggcactcat tgggagatcg atttcaccga 5100 
tcttctagtt tttatagata ccttttctgg 5160 
aaccgccaag gtcgtaacca agaagctact 5220 
tcaggtattg ggaactgaca atgggcctgc 5280 
cgatctgttg gggattgatt ggaaattaca 5340 
ggtagaaaga atgaatagaa ccatcaagga 5400 
ctctagagac tgggtgctcc tactcccctt 5460 
cccccatggc ctcaccccat atgagatctt 5520 
ccctgaccct gacatgacaa gagttactaa 5580 
tctctactta gtccagcacg aagtctggag 5640 
ggaccgaccg gtggtacctc acccttaccg 5700 
ccagactaag aacctagaac ctcgctggaa 5760 
caccgccctc aaagtagacg gcatcgcagc 5820 
cgaccccggg ggtggaccat cctctagact 5880 
cttaaaaata aggttaaccc gcgaggcccc 5940 
ggggtcagta ctgcttcgcc cggctccagt 6000 
aacatttctc aacaagatct agaattagta 6060 
tatgaggata ataaacatca tgtgggagcg 6120 
tcggcagtac atattgaagc gtatatagga 6180 
attggtagtg cagtttcgaa tggacaaaag 6240 
ccttattctg acgaagtaga tagaagtatt 6300 
gagttgattt cagactatgc accagattgt 6360 
gtcaaaacta cgattgaaga actcattcca 6420 
caccaagctt atcgattagt ccaatttgtt 6480 
gttttgactc aacaatatca ccagctgaag 6540 
aagattttat ttagtctcca gaaaaagggg 6600 
aagctagctt aagtaacgcc attttgcaag 6660 
aagttcagat caaggtcagg aacagatgga 6720 




acagtcgaga acttgtttat tgcagcttat aatggttaca aataaagcaa tagcatcaca 6780 
aatttcacaa ataaagcatt tttttcactg cattctagtt gtggtttgtc caaactcatc 6840 
aatgtatctt atcatgtctg gatccccagg aagctcctct gtgtcctcat aaaccctaac 6900 
ctcctctact tgagaggaca ttccaatcat aggctgccca tccaccctct gtgtcctcct 6960 
gttaattagg tcacttaaca aaaaggaaat tgggtagggg tttttcacag accgctttct 7020 
aagggtaatt ttaaaatatc tgggaagtcc cttccactgc tgtgttccag aagtgttggt 7080 
aaacagccca caaatgtcaa cagcagaaac atacaagctg tcagctttgc acaagggccc 7140 
aacaccctgc tcatcaagaa gcactgtggt tgctgtgtta gtaatgtgca aaacaggagg 7200 
cacattttcc ccacctgtgt aggttccaaa atatctagtg ttttcatttt tacttggatc 7260 
aggaacccag cactccactg gataagcatt atccttatcc aaaacagcct tgtggtcagt 7320 
gttcatctgc tgactgtcaa ctgtagcatt ttttggggtt acagtttgag caggatattt 7380 
ggtcctgtag tttgctaaca caccctgcag ctccaaaggt tccccaccaa cagcaaaaaa 7440 
atgaaaattt gacccttgaa tgggttttcc agcaccattt tcatgagttt tttgtgtccc 7500 
tgaatgcaag tttaacatag cagttacccc aataacctca gttttaacag taacagcttc 7560 
ccacatcaaa atatttccac aggttaagtc ctcatttaaa ttaggcaaag gaattc 7616 

<210> 3 
<211> 7308 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Portion of 
construct 



<400> 3 

agatctcccg 

agccagtatc 

taagctacaa 

cgttttgcgc 

agttattaat 

gttacataac 

acgtcaataa 

tgggtggact 

agtacgcccc 

atgaccttat 

atggtgatgc 

tttccaagtc 

gactttccaa 

cggtgggagg 

gcttatcgaa 

ctttttcgct 

tagaatggga 

ctttctactc 

ttcgttaaac 

agattgtaag 

tacttcagca 

gcatataaat 

catcatcctg 

atactctgag 

acagctcctg 

gcaagcttct 

gagaatatgg 

gtcgagcgga 

tgctctgcag 

gacctcatca 

caggtcccct 

ccctttgtac 

gaacctcctc 

ggcgccaaac 

acagaagacc 

ggtggagaag 

cgtgggagac 

gcaggaggaa 

aaaaataata 

gttctcatca 

accggagaag 



atcccctatg 
tgctccctgc 
caaggcaagg 
tgcttcgcga 
agtaatcaat 
ttacggtaaa 
tgacgtatgt 
atttacggta 
ctattgacgt 
gggactttcc 
ggttttggca 
tccaccccat 
aatgtcgtaa 
tctatataag 
atgtcgactg 
attgtaaaat 
agatgtccct 
tgttgacaac 
tttagcttgc 
tactttctct 
cagttttaga 
tctggctggc 
cctttctctt 
tccaaaccgg 
ggcaacgtgc 
gcagcatcgt 
gccagactgt 
tcgctcacaa 
aatggccaac 
cccaggttaa 
acatcgtgac 
accctaagcc 
gttcgacccc 
ctaaacctca 
ccccgcctta 
cgacccctgc 
gggagccccc 
acggacagct 
acccttcttt 
cccatcagcc 
aaaaacaacg 



gtcgactctc 
ttgtgtgttg 
cttgaccgac 
tgtacgggcc 
tacggggtca 
tggcccgcct 
tcccatagta 
aactgcccac 
caatgacggt 
tacttggcag 
gtacatcaat 
tgacgtcaat 
caactccgcc 
cagagctctc 
agaacttcag 
tcatgttata 
tgtatcacca 
cattgtctcc 
atttgtaacg 
aatcactttt 
gaacaattgt 
gtggaaatat 
tatggttaca 
gcccctctgc 
tggttgttgt 
tctgtgttgt 
taccactccc 
ccagtcggta 
ctttaacgtc 
gatcaaggtc 
ctgggaagcc 
tccgcctcct 
gcctcgatcc 
agttctttct 
tagggaccca 
gggagaggca 
tgtggccgac 
tcaatactgg 
ttctgaagat 
cacctgggac 
ggtgctctta 



agtacaatct 
gaggtcgctg 
aattgcatga 
agatatacgc 
ttagttcata 
ggctgaccgc 
acgccaatag 
ttggcagtac 
aaatggcccg 
tacatctacg 
gggcgtggat 
gggagtttgt 
ccattgacgc 
tggctaacta 
ggtgagtttg 
tggagggggc 
tggaccctca 
tcttattttc 
aatttttaaa 
ttttcaaggc 
tataattaaa 
tcttattggt 
atgatataca 
taaccatgtt 
gctgtctcat 
ctctgtctga 
ttaagtttga 
gatgtcaaga 
ggatggccgc 
ttttcacctg 
ttggcttttg 
cttcctccat 
tccctttatc 
gacagtgggg 
agaccacccc 
ccggacccct 
tccactacct 
ccgttctcct 
ccaggtaaac 
gactgtcagc 
gaggctagaa 



gctctgatgc 
agtagtgcgc 
agaatctgct 
gttgacattg 
gcccatatat 
ccaacgaccc 
ggactttcca 
atcaagtgta 
cctggcatta 
tattagtcat 
agcggtttga 
tttggcacca 
aaatgggcgg 
gagaacccac 
gggacccttg 
aaagttttca 
tgataatttt 
ttttcatttt 
ttcacttttg 
aatcagggta 
tgataaggta 
agaaacaact 
ctgtttgaga 
catgccttct 
cattttggca 
ctgtgtttct 
ccttaggtca 
agagacgttg 
gagacggcac 
gcccgcatgg 
acccccctcc 
ccgccccgtc 
cagccctcac 
ggccgctcat 
cttccgacag 
ccccaatggc 
cgcaggcatt 
cttctgacct 
tgacagctct 
agctgttggg 
aggcggtgcg 



cgcatagtta 
gagcaaaatt 
tagggttagg 
attattgact 
ggagttccgc 
ccgcccattg 
ttgacgtcaa 
tcatatgcca 
tgcccagtac 
cgctattacc 
ctcacgggga 
aaatcaacgg 
taggcgtgta 
tgcttaactg 
attgttcttt 
gggtgttgtt 
gtttctttca 
ctgtaacttt 
tttatttgtc 
tattatattg 
gaatatttct 
acatcctggt 
tgaggataaa 
tctttttcct 
agaattggcc 
gtatttgtct 
ctggaaagat 
ggttaccttc 
ctttaaccga 
acacccagac 
ctgggtcaag 
tctccccctt 
tccttctcta 
cgacctactt 
ggacggaaat 
atctcgccta 
ccccctccgc 
ttacaactgg 
gatcgagtct 
gactctgctg 
gggcgatgat 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1740 

1800 

1860 

1920 

1980 

2040 

2100 

2160 

2220 

2280 

2340 

2400 

2460 



gggcgcccca 

tgggattaca 

gcgggtctcc 

caagggccca 

tacactcctt 

tggcagtctg 

acgcttggag 

gaaagagagg 

gatgagcaga 

gccactgtcg 

ctcgatcgcg 

aagaaaccac 

tagggaggtc 

caacccgtca 

ggacccctaa 

tggaccacgg 

gtaccagact 

atccactttg 

ttgaccctaa 

tctctagggt 

ggactggcag 

tccataaaac 

agactgttgg 

cccgttaaga 

aagcgggtgg 

ccaccgtccc 

ctccacccca 

tcaggacaat 

gatgaggcac 

ctacagtacg 

actcgggccc 

caaatttgcc 

ctgactgagg 

ctaagggagt 

atggcagccc 

caacaaaagg 

ccagatttga 

gtcctaacgc 

gacccagtag 

acaaaggatg 

gtagaggcac 

tatcaggcct 

ccggctacgc 

gccgaagccc 

acctggtaca 

gtgaccaccg 

cgggctgaac 

gtttatactg 

aggcgtgggt 

ctactaaaag 

aagggacaca 

gccatcacag 

tcagaacatt 

tatgataaaa 

acttttgaat 

gctctcctag 

aatatcactg 

cagggaacta 

ataaagcccg 

tggatagaag 

gaggagatct 

ttcgtctcca 

tgtgcataca 

actttaacta 

gccctgtacc 

tatggggcac 

aqcccctctc 



ctcaactgcc 

ccacccaggc 

aaaacgcggg 

atgagtctcc 

atgaccctga 

ccccagacat 

atttggttag 

aacgtatcag 

aagagaaaga 

ttagtggaca 

accagtgtgc 

gaggacctcg 

agggtcagga 

ccttcctggt 

gtgataagtc 

atcgcaaagt 

gtccctatcc 

agggatcagg 

atatagaaga 

ccacatggct 

ttcgccaagc 

aataccccat 

accagggaat 

aaccagggac 

aagaeatcca 

accagtggta 

ccagtcagcc 

tgacctggac 

tgcacagaga 

tggatgactt 

tgttacaaac 

agaaacaggt 

ccagaaaaga 

tcctagggac 

ccttgtaccc 

cctatcaaga 

ctaagccctt 

aaaaactggg 

cagctgggtg 

caggcaagct 

tagtcaaaca 

tgcttttgga 

tgctcccact 

acggaacccg 

cggatggaag 

agaccgaggt 

tgatagcact 

atagccgtta 

tgctcacatc 

ccctctttct 

gcgccgaggc 

agactccaga 

ttcattacac 

caaagaagta 

tattagactt 

agagaagcca 

agacctgcaa 

gggtccgcgg 

gattgtatgg 

ccttcccaac 

tccccaggtt 

aggtgagtca 

gaccccaaag 

aattaacgct 

gagcccgcaa 

ccccgcccct 

tccaagctca 



caatgaagtc 

aggtaggaac 

cagaagcccc 

ctcggccttc 

ggacccaggg 

tgggagaaag 

agaggcagaa 

gagagaaaca 

aagagatcgt 

gaaacaggat 

ctactgcaaa 

gggaccaaga 

gcccccccct 

agatactggg 

tgcctgggtc 

acatctagct 

tctgttagga 

agctcaggtt 

tgagcatcgg 

gtctgatttt 

tcctctgatc 

gtcacaagaa 

actggtaccc 

taatgattat 

ccccaccgtg 

cactgtgctt 

tctcttcgcc 

cagactccca 

cctagcagac 

actgctggcc 

cctagggaac 

caagtatctg 

gactgtgatg 

ggcaggcttc 

tctcaccaaa 

aatcaagcaa 

tgaactcttt 

accttggcgt 

gcccccttgc 

aaccatggga 

accccccgac 

cacggaccgg 

gcctgaggaa 

acccgaccta 

cagtctctta 

aatctgggct 

cacccaggcc 

tgcttttgct 

agaaggcaaa 

gcccaaaaga 

tagaggcaac 

cacctctacc 

agtgactgat 

ttgggtctac 

tcttcatcag 

cagtccctac 

agcttgtgca 

gcatcggccc 

ctataaatat 

caagaaagaa 

cggcatgcct 

gacagtggcc 

ctcaggccag 

tgcaactggc 

cacgccgggc 

tgtaaacttc 

cttacaggct 



gatgccgctt 

cacctagtcc 

accaatttgg 

ctagagagac 

caagaaacta 

ttagagaggt 

aagatcttta 

gaggaaaaag 

aggagacata 

agacagggag 

gaaaaggggc 

ccccagacct 

gaacccagga 

gcccaacact 

caaggggcta 

accggtaagg 

agagatttgc 

atgggaccaa 

ctacatgaga 

cctcaggcct 

atacctctga 

gccagactgg 

tgccagtccc 

aggcctgtcc 

cccaaccctt 

gatttaaagg 

tttgagtgga 

cagggtttca 

ttccggatcc 

gccacttctg 

ctcgggtatc 

gggtatcttc 

gggcagccta 

tgtcgcctct 

acggggactc 

gctcttctaa 

gtcgacgaga 

cggccggtgg 

ctacggatgg 

cagccactag 

cgctggcttt 

gtccagttcg 

gggctgcaac 

acggaccagc 

caagagggac 

aaagccctgc 

ctaaagatgg 

actgcccata 

gagatcaaaa 

cttagcataa 

cggatggctg 

ctcctcatag 

ataaaggacc 

caaggaaaac 

ctgactcacc 

tacatgctga 

caagtcaacg 

ggcactcatt 

cttctagttt 

accgccaagg 

caggtattgg 

gatctgttgg 

gtagaaagaa 

tctagagact 

ccccatggcc 

cctgaccctg 

ctctacttag 



ttcccctcga 

actatcgcca 

ccaaggtaaa 

ttaaggaagc 

atgtgtctat 

tagaagattt 

ataaacgaga 

aagaacgccg 

gagagatgag 

gagaacgaag 

actgggctaa 

ccctcctgac 

taaccctcaa 

ccgtgctgac 

ctggaggaaa 

tcacccactc 

tgactaaact 

tggggcagcc 

cctcaaaaga 

gggcggaaac 

aagcaacctc 

ggatcaagcc 

cctggaacac 

aggatctgag 

acaacctctt 

atgccttttt 

gagatccaga 

aaaacagtcc 

agcacccaga 

agctagactg 

gggcctcggc 

taaaagaggg 

ctccgaagac 

ggatccctgg 

tgtttaattg 

ctgccccagc 

agcagggcta 

cctacctgtc 

tagcagccat 

tcattctggc 

ccaacgcccg 

gaccggtggt 

acaactgcct 

cgctcccaga 

agcgtaaggc 

cagccgggac 

cagaaggtaa 

tccatggaga 

ataaagacga 

tccattgtcc 

accaagcggc 

aaaattcatc 

taaccaagtt 

ctgtgatgcc 

tcagcttctc 

accgggatcg 

ccagcaagtc 

gggagatcga 

ttatagatac 

tcgtaaccaa 

gaactgacaa 

ggattgattg 

tgaatagaac 

gggtgctcct 

tcaccccata 

acatgacaag 

tccagcacga 



gcgcccagac 

gttgctccta 

aggaataaca 

ctatcgcagg 

gtctttcatt 

aaaaaacaag 

aaccccggaa 

taggacagag 

caagctattg 

gaggtcccaa 

agattgtccc 

cctagatgac 

agtcgggggg 

ccaaaatcct 

gcggtatcgc 

tttcctccat 

aaaagcccaa 

cctgcaagtg 

gccagatgtt 

cgggggcatg 

tacccccgtg 

ccacatacag 

gcccctgcta 

agaagtcaac 

gagcgggctc 

ctgcctgaga 

gatgggaatc 

caccctgttt 

cttgatcctg 

ccaacaaggt 

caagaaagcc 

tcagagatgg 

ccctcgacaa 

gtttgcagaa 

gggcccagac 

cctggggttg 

cgccaaaggt 

caaaaagcta 

tgccgtactg 

cccccatgca 

gatgactcac 

agccctgaac 

tgatatcctg 

cgccgaccac 

gggagctgcg 

atccgctcag 

gaagctaaat 

aatatacaga 

gatcttggcc 

aggacatcaa 

ccgaaaggca 

accctacacc 

gggggccatt 

tgaccagttt 

aaaaatgaag 

aacactcaaa 

tgccgttaaa 

tttcaccgag 

cttttctggc 

gaagctacta 

tgggcctgcc 

gaaattacat 

catcaaggag 

actcccctta 

tgagatctta 

agttactaac 

agtctggaga 



2520 

2580 

2640 

2700 

2760 

2820 

2880 

2940 

3000 

3060 

3120 

3180 

3240 

3300 

3360 

3420 

3480 

3540 

3600 

3660 

3720 

3780 

3840 

3900 

3960 

4020 

4080 

4140 

4200 

4260 

4320 

4380 

4440 

4500 

4560 

4620 

4680 

4740 

4800 

4860 

4920 

4980 

5040 

5100 

5160 

5220 

5280 

5340 

5400 

5460 

5520 

5580 

5640 

5700 

5760 

5820 

5880 

5940 

6000 

6060 

6120 

6180 

6240 

6300 

6360 

6420 

6480 



1* 



cctctggcgg 
gtcggcgaca 
ggaccttaca 
tggatacacg 
acatggcgcg 
taatcccctt 
cggcccagcc 
aagtagcgac 
caattcgtac 
gagtaactgt 
attttgacac 
gagtggtaag 
ttgtgttaat 
tcaaatatac 



cagcctacca 
cagtgtgggt 
cagtcctgct 
ccgcccacgt 
ttcaacgctc 
aattcttctg 
ggccaccatg 
agagaagatt 
gaaaacagga 
ttgtgcagaa 
gattgtagct 
tccttgtggt 
agaaatgaat 
ccgaaattaa 



agaacaactg 
ccgccgacac 
gaccaccccc 
gaaggctgcc 
tcaaaacccc 
atgctcagag 
aaaacattta 
acaatgcttt 
gaaatcattt 
gccattgcga 
gttagacacc 
atgtgtaggg 
ggcaagttag 
aagttttacc 



gaccgaccgg 
cagactaaga 
accgccctca 
gaccccgggg 
ttaaaaataa 
gggtcagtac 
acatttctca 
atgaggataa 
cggcagtaca 
ttggtagtgc 
cttattctga 
agttgatttc 
tcaaaactac 
accaagctta 



tggtacctca 
acctagaacc 
aagtagacgg 
gtggaccatc 
ggttaacccg 
tgcttcgccc 
acaagatcta 
taaacatcat 
tattgaagcg 
agtttcgaat 
cgaagtagat 
agactatgca 
gattgaagaa 
tcgaattc 



cccttaccga 
tcgctggaaa 
catcgcagct 
ctctagactg 
cgaggccccc 
ggctccagtg 
gaattagtag 
gtgggagcgg 
tatataggac 
ggacaaaagg 
agaagtattc 
ccagattgtt 
ctcattccac 



<210> 4 
<211> 7308 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Portion of 
construct 



6540 
6600 
6660 
6720 
6780 
6840 
6900 
6960 
7020 
7080 
7140 
7200 
7260 
7308 



<400> 4 

agatctcccg 

agccagtatc 

taagctacaa 

cgttttgcgc 

agttattaat 

gttacataac 

acgtcaataa 

tgggtggact 

agtacgcccc 

atgaccttat 

atggtgatgc 

tttccaagtc 

gactttccaa 

cggtgggagg 

gcttatcgaa 

ctttttcgct 

tagaatggga 

ctttctactc 

ttcgttaaac 

agattgtaag 

tacttcagca 

gcatataaat 

catcatcctg 

atactctgag 

acagctcctg 

gcaagcttct 

gagaatatgg 

gtcgagcgga 

tgctctgcag 

gacctcatca 

caggtcccct 

ccctttgtac 

gaacctcctc 

ggcgccaaac 

acagaagacc 

ggtggagaag 

cgtgggagac 

gcaggaggaa 

aaaaataata 

gttctcatca 

accggagaag 

gggcgcccca 



atcccctatg 
tgctccctgc 
caaggcaagg 
tgcttcgcga 
agtaatcaat 
ttacggtaaa 
tgacgtatgt 
atttacggta 
ctattgacgt 
gggactttcc 
ggttttggca 
tccaccccat 
aatgtcgtaa 
tctatataag 
atgtcgactg 
attgtaaaat 
agatgtccct 
tgttgacaac 
tttagcttgc 
tactttctct 
cagttttaga 
tctggctggc 
cctttctctt 
tccaaaccgg 
ggcaacgtgc 
gcagcatcgt 
gccagactgt 
tcgctcacaa 
aatggccaac 
cccaggttaa 
acatcgtgac 
accctaagcc 
gttcgacccc 
ctaaacctca 
ccccgcctta 
cgacccctgc 
gggagccccc 
acggacagct 
acccttcttt 
cccatcagcc 
aaaaacaacg 
ctcaactgcc 



gtcgactctc 
ttgtgtgttg 
cttgaccgac 
tgtacgggcc 
tacggggtca 
tggcccgcct 
tcccatagta 
aactgcccac 
caatgacggt 
tacttggcag 
gtacatcaat 
tgacgtcaat 
caactccgcc 
cagagctctc 
agaacttcag 
tcatgttata 
tgtatcacca 
cattgtctcc 
atttgtaacg 
aatcactttt 
gaacaattgt 
gtggaaatat 
tatggttaca 
gcccctctgc 
tggttgttgt 
tctgtgttgt 
taccactccc 
ccagtcggta 
ctttaacgtc 
gatcaaggtc 
ctgggaagcc 
tccgcctcct 
gcctcgatcc 
agttctttct 
tagggaccca 
gggagaggca 
tgtggccgac 
tcaatactgg 
ttctgaagat 
cacctgggac 
ggtgctctta 
caatgaagtc 



agtacaatct 
gaggtcgctg 
aattgcatga 
agatatacgc 
ttagttcata 
ggctgaccgc 
acgccaatag 
ttggcagtac 
aaatggcccg 
tacatctacg 
gggcgtggat 
gggagtttgt 
ccattgacgc 
tggctaacta 
ggtgagtttg 
tggagggggc 
tggaccctca 
tcttattttc 
aatttttaaa 
ttttcaaggc 
tataattaaa 
tcttattggt 
atgatataca 
taaccatgtt 
gctgtctcat 
ctctgtctga 
ttaagtttga 
gatgtcaaga 
ggatggccgc 
ttttcacctg 
ttggcttttg 
cttcctccat 
tccctttatc 
gacagtgggg 
agaccacccc 
ccggacccct 
tccactacct 
ccgttctcct 
ccaggtaaac 
gactgtcagc 
gaggctagaa 
gatgccgctt 



gctctgatgc 
agtagtgcgc 
agaatctgct 
gttgacattg 
gcccatatat 
ccaacgaccc 
ggactttcca 
atcaagtgta 
cctggcatta 
tattagtcat 
agcggtttga 
tttggcacca 
aaatgggcgg 
gagaacccac 
gggacccttg 
aaagttttca 
tgataatttt 
ttttcatttt 
ttcacttttg 
aatcagggta 
tgataaggta 
agaaacaact 
ctgtttgaga 
catgccttct 
cattttggca 
ctgtgtttct 
ccttaggtca 
agagacgttg 
gagacggcac 
gcccgcatgg 
acccccctcc 
ccgccccgtc 
cagccctcac 
ggccgctcat 
cttccgacag 
ccccaatggc 
cgcaggcatt 
cttctgacct 
tgacagctct 
agctgttggg 
aggcggtgcg 
ttcccctcga 



cgcatagtta 
gagcaaaatt 
tagggttagg 
attattgact 
ggagttccgc 
ccgcccattg 
ttgacgtcaa 
tcatatgcca 
tgcccagtac 
cgctattacc 
ctcacgggga 
aaatcaacgg 
taggcgtgta 
tgcttaactg 
attgttcttt 
gggtgttgtt 
gtttctttca 
ctgtaacttt 
tttatttgtc 
tattatattg 
gaatatttct 
acatcctggt 
tgaggataaa 
tctttttcct 
agaattggcc 
gtatttgtct 
ctggaaagat 
ggttaccttc 
ctttaaccga 
acacccagac 
ctgggtcaag 
tctccccctt 
tccttctcta 
cgacctactt 
ggacggaaat 
atctcgccta 
ccccctccgc 
ttacaactgg 
gatcgagtct 
gactctgctg 
gggcgatgat 
gcgcccagac 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1740 

1800 

1860 

1920 

1980 

2040 

2100 

2160 

2220 

2280 

2340 

2400 

2460 

2520 



tgggattaca ccacccaggc aggacgcaac 
gcgggtctcc aaaacgcggg cagaagcccc 
caagggccca atgagtctcc ctcggccttc 
tacactcctt atgaccctga ggacccaggg 
tggcagtctg ccccagacat tgggagaaag 
acgcttggag atttggttag agaggcagaa 
gaaagagagg aacgtatcag gagagaaaca 
gatgagcaga aagagaaaga aagagatcgt 
gccactgtcg ttagtggaca gaaacaggat 
ctcgatcgcg accagtgtgc ctactgcaaa 
aagaaaccac gaggacctcg gggaccaaga 
tagggaggtc agggtcagga gcccccccct 
caacccgtca ccttcctggt agatactggg 
ggacccctaa gtgataagtc tgcctgggtc 
tggaccacgg atcgcaaagt acatctagct 
gtaccagact gtccctatcc tctgttagga 
atccactttg agggatcagg agctcaggtt 
ttgaccctaa atatagaaga tgagcatcgg 
tctctagggt ccacatggct gtctgatttt 
ggactggcag ttcgccaagc tcctctgatc 
tccataaaac aataccccat gtcacaagaa 
agactgttgg accagggaat actggtaccc 
cccgttaaga aaccagggac taatgattat 
aagcgggtgg aagacatcca ccccaccgtg 
ccaccgtccc accagtggta cactgtgctt 
ctccacccca ccagtcagcc tctcttcgcc 
tcaggacaat tgacctggac cagactccca 
gatgaggcac tgcacagaga cctagcagac 
ctacagtacg tggatgactt actgctggcc 
actcgggccc tgttacaaac cctagggaac 
caaatttgcc agaaacaggt caagtatctg 
ctgactgagg ccagaaaaga gactgtgatg 
ctaagggagt tcctagggac ggcaggcttc 
atggcagccc ccttgtaccc tctcaccaaa 
caacaaaagg cctatcaaga aatcaagcaa 
ccagatttga ctaagccctt tgaactcttt 
gtcctaacgc aaaaactggg accttggcgt 
gacccagtag cagctgggtg gcccccttgc 
acaaaggatg caggcaagct aaccatggga 
gtagaggcac tagtcaaaca accccccgac 
tatcaggcct tgcttttgga cacggaccgg 
ccggctacgc tgctcccact gcctgaggaa 
gccgaagccc acggaacccg acccgaccta 
acctggtaca cggatggaag cagtctctta 
gtgaccaccg agaccgaggt aatctgggct 
cgggctgaac tgatagcact cacccaggcc 
gtttatactg atagccgtta tgcttttgct 
aggcgtgggt tgctcacatc agaaggcaaa 
ctactaaaag ccctctttct gcccaaaaga 
aagggacaca gcgccgaggc tagaggcaac 
gccatcacag agactccaga cacctctacc 
tcagaacatt ttcattacac agtgactgat 
tatgataaaa caaagaagta ttgggtctac 
acttttgaat tattagactt tcttcatcag 
gctctcctag agagaagcca cagtccctac 
aatatcactg agacctgcaa agcttgtgca 
cagggaacta gggtccgcgg gcatcggccc 
ataaagcccg gattgtatgg ctataaatat 
tggatagaag ccttcccaac caagaaagaa 
gaggagatct tccccaggtt cggcatgcct 
ttcgtctcca aggtgagtca gacagtggcc 
tgtgcataca gaccccaaag ctcaggccag 
actttaacta aattaacgct tgcaactggc 
gccctgtacc gagcccgcaa cacgccgggc 
tatggggcac ccccgcccct tgtaaacttc 
agcccctctc tccaagctca cttacaggct 
cctctggcgg cagcctacca agaacaactg 



'0 

cacctagtcc actatcgcca gttgctccta 2580 
accaatttgg ccaaggtaaa aggaataaca 2640 
ctagagagac ttaaggaagc ctatcgcagg 2700 
caagaaacta atgtgtctat gtctttcatt 2760 
ttagagaggt tagaagattt aaaaaacaag 2820 
aagatcttta ataaacgaga aaccccggaa 2880 
gaggaaaaag aagaacgccg taggacagag 2940 
aggagacata gagagatgag caagctattg 3000 
agacagggag gagaacgaag gaggtcccaa 3060 
gaaaaggggc actgggctaa agattgtccc 3120 
ccccagacct ccctcctgac cctagatgac 3180 
gaacccagga taaccctcaa agtcgggggg 3240 
gcccaacact ccgtgctgac ccaaaatcct 3300 
caaggggcta ctggaggaaa gcggtatcgc 3360 
accggtaagg tcacccactc tttcctccat 3420 
agagatttgc tgactaaact aaaagcccaa 3480 
atgggaccaa tggggcagcc cctgcaagtg. 3540 
ctacatgaga cctcaaaaga gccagatgtt 3600 
cctcaggcct gggcggaaac cgggggcatg 3660 
atacctctga aagcaacctc tacccccgtg 3720 
gccagactgg ggatcaagcc ccacatacag 3780 
tgccagtccc cctggaacac gcccctgcta 3840 
aggcctgtcc aggatctgag agaagtcaac 3900 
cccaaccctt acaacctctt gagcgggctc 3960 
gatttaaagg atgccttttt ctgcctgaga 4020 
tttgagtgga gagatccaga gatgggaatc 4080 
cagggtttca aaaacagtcc caccctgttt 4140 
ttccggatcc agcacccaga cttgatcctg 4200 
gccacttctg agctagactg ccaacaaggt 4260 
ctcgggtatc gggcctcggc caagaaagcc 4320 
gggtatcttc taaaagaggg tcagagatgg 4380 
gggcagccta ctccgaagac ccctcgacaa 4440 
tgtcgcctct ggatccctgg gtttgcagaa 4500 
acggggactc tgtttaattg gggcccagac 4560 
gctcttctaa ctgccccagc cctggggttg 4620 
gtcgacgaga agcagggcta cgccaaaggt 4680 
cggccggtgg cctacctgtc caaaaagcta 4740 
ctacggatgg tagcagccat tgccgtactg 4800 
cagccactag tcattctggc cccccatgca 4860 
cgctggcttt ccaacgcccg gatgactcac 4920 
gtccagttcg gaccggtggt agccctgaac 4980 
gggctgcaac acaactgcct tgatatcctg 5040 
acggaccagc cgctcccaga cgccgaccac 5100 
caagagggac agcgtaaggc gggagctgcg 5160 
aaagccctgc cagccgggac atccgctcag 5220 
ctaaagatgg cagaaggtaa gaagctaaat 5280 
actgcccata tccatggaga aatatacaga 5340 
gagatcaaaa ataaagacga gatcttggcc 5400 
cttagcataa tccattgtcc aggacatcaa 5460 
cggatggctg accaagcggc ccgaaaggca 5520 
ctcctcatag aaaattcatc accctacacc 5580 
ataaaggacc taaccaagtt gggggccatt 5640 
caaggaaaac ctgtgatgcc tgaccagttt 5700 
ctgactcacc tcagcttctc aaaaatgaag 5760 
tacatgctga accgggatcg aacactcaaa 5820 
caagtcaacg ccagcaagtc tgccgttaaa 5880 
ggcactcatt gggagatcga tttcaccgag 5940 
cttctagttt ttatagatac cttttctggc 6000 
accgccaagg tcgtaaccaa gaagctacta 6060 
caggtattgg gaactgacaa tgggcctgcc 6120 
gatctgttgg ggattgattg gaaattacat 6180 
gtagaaagaa tgaatagaac catcaaggag 6240 
tctagagact gggtgctcct actcccctta 6300 
ccccatggcc tcaccccata tgagatctta 6360 
cctgaccctg acatgacaag agttactaac 6420 
ctctacttag tccagcacga agtctggaga 6480 
gaccgaccgg tggtacctca cccttaccga 6540 



X 

gtcggcgaca cagtgtgggt ccgccgacac cagactaaga acctagaacc tcgctggaaa 6600 
ggaccttaca cagtcctgct gaccaccccc accgccctca aagtagacgg catcgcagct 6660 
tggatacacg ccgcccacgt gaaggctgcc gaccccgggg gtggaccatc ctctagactg 6720 
acatggcgcg ttcaacgctc tcaaaacccc ttaaaaataa ggttaacccg cgaggccccc 6780 
taatcccctt aattcttctg atgctcagag gggtcagtac tgcttcgccc ggctccagtg 6840 
cggcccagcc ggccaccatg aaaacattta acatttctca acaagatcta gaattagtag 6900 
aagtagcgac agagaagatt acaatgcttt atgaggataa taaacatcat gtgggagcgg 6960 
caattcgtac gaaaacagga gaaatcattt cggcagtaca tattgaagcg tatataggac 7020 
gagtaactgt ttgtgcagaa gccattgcga ttggtagtgc agtttcgaat ggacaaaagg 7080 
attttgacac gattgtagct gttagacacc cttattctga cgaagtagat agaagtattc 7140 
gagtggtaag tccttgtggt atgtgtaggg agttgatttc agactatgca ccagattgtt 7200 
ttgtgttaat agaaatgaat ggcaagttag tcaaaactac gattgaagaa ctcattccac 7260 
tcaaatatac ccgaaattaa aagttttacc accaagctta tcgaattc 7308 

<210> 5 
<211> 6028 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Portion of 
construct 

<220> 

<221> misc_feature 
<222> (3774) 

<223> n is any nucleotide 
<220> 

<221> misc_f eature 
<222> (3775) 

<223> n is any nucleotide 
<220> 

<221> misc^f eature 
<222> (3776) 

<223> n is any nucleotide 
<220> 

<221> misc_feature 
<222> (3777) 

<223> n is any nucleotide 
<220> 

<221> misc_feature 
<222> (3962) 

<223> n is any nucleotide 
<220> 

<221> misc_feature 
<222> (3963) 

<223> n is any nucleotide 
<220> 

<221> misc_feature 
<222> (3964) 

<223> n is any nucleotide 
<220> 

<221> misc_feature 
<222> (3965) 

<223> n is any nucleotide 
<400> 5 

catatgcggt gtgaaatacc gcacagatgc gtaaggagaa aataccgcat caggcgccat 60 
tcgccattca ggctgcgcaa ctgttgggaa gggcgatcgg tgcgggcctc ttcgctatta 120 
caccaactao caaaaqqqqq atqtqctqca aggcgattaa gttgggtaac gccagggttt 180 



tcccagtcac 
gacaggatct 
ctagaatacg 
gaaagacccc 
gaaaaatacc 
aacgttgggc 
acagatggtc 
ggcccaaccc 
tgaaatgacc 
gcgcttctgc 
ctccgataga 
ccgactcgtg 
cgggggtctt 
acccaccacc 
acttccctga 
aggctctcta 
aactggaccg 
gacatcaaac 
cccccaccgc 
cggccgacac 
accccctcaa 
agtagggatg 
gatgactggg 
aaaattatat 
accgtatgtc 
cttttacgtg 
ctgtggtaaa 
ggacctaatc 
ctgtggcccc 
gggcagatgc 
cgggcccaaa 
ctccctgacc 
attacccgac 
tagccccctc 
aagtccaagt 
aggagcctat 
cttagtgtcg 
ttccaccgct 
gacaggacag 
cacccaaagc 
ttgcagcact 
ttgtgtatta 
tcagcttgaa 
actaggagga 
aattaaaacc 
cgaaaagtca 
gaaccgcaga 
agaagaatgt 
aagagaaagg 
gctgtttaat 
agtactctta 
taaagacagg 
gcctatagag 
cggcatagta 
ttccggtgct 
ggttctcccg 
tgttcatcag 
tgcgcggcct 
acgcctccgg 
tgcgcgaccc 
accggtcgac 
atcacaaatt 
ctcatcaatg 
tnnnnacatg 
ggcgtttttc 
gaggtggcga 
cgtgcgctct 



gacgttgtaa 
cagtagtcca 
agccacaata 
accaaattgc 
aaaccaagaa 
caaacaggat 
accgcggttc 
tcagcagttt 
ctgtgcctta 
ttcccgagct 
ctgagtcgcc 
gtctcgctgt 
tcatttgggg 
gggaggtaag 
ccctgacatg 
cttagtccag 
gccggtggtg 
caagaaccta 
cctcaaagta 
cgagagtgga 
gataagatta 
gcagagagcc 
cgtaccgcca 
tttgatctat 
gggtatggct 
tgccctgggc 
tgggggtgtg 
tcccttaagc 
tgctacgacc 
aaccctctag 
tcgtggggac 
cggcaggtcc 
caaagactcc 
aataccagtt 
gtcccacagc 
caggcgctta 
ggacctcctt 
ccggccaact 
ggcctatgca 
gccggctcag 
ggattgactc 
gttgaactct 
cagcgtacca 
ttaaccatgg 
cagcagtttg 
attaccaacc 
ggcctagatt 
tgtttttatg 
cttaatcaga 
agatccccct 
ctgatcttac 
atctcagtag 
tacgagccat 
taatacgact 
caccgcgcgc 
ggacttcgtg 
cgcggtccag 
ggacgagctg 
gccggccatg 
ggccggcaac 
ttgttaactt 
tcacaaataa 
tatcttatca 
tgagcaaaag 
cataggctcc 
aacccgacag 
cctgttccga 



aacgacggcc 
ggctttagtc 
aataaaagat 
ttagcctgat 
tagagaagtt 
atctgcggtg 
ggccccggcc 
cttaagaccc 
tttgaattaa 
ctataaaaga 
cgggtacccg 
tccttgggag 
gctcgtccgg 
ctggccaaga 
accagagtta 
cacgaagttt 
cctcaccctt 
gaacctcgct 
gacggtatcg 
ccatcctctg 
acccgtggaa 
cccatcaggt 
atgccacctc 
gtgatctggt 
gcaagtaccc 
ataccgtaaa 
aaaccaccgg 
gcggtaacac 
tctccaaagt 
tcctagaatt 
tgagactgta 
ttaatgtggg 
cttcctcacc 
accccccttc 
cacccccagg 
acctcaccaa 
attacgaagg 
gtacggccac 
tgggggcagt 
gatcctacta 
cctgcttgtc 
ggcccagagt 
aatataaaag 
gagggattgc 
agcagcttca 
tagaaaagtc 
tgctattcct 
cagaccacac 
gacaaaaact 
ggtttaccac 
tctttggacc 
tccaggcttt 
agggcgccta 
cactatagga 
gacgtcgccg 
gaggacgact 
gaccaggtgg 
tacgccgagt 
accgagatcg 
tgcgtgcact 
gtttattgca 
agcatttttt 
tgtctggatc 
gccagcaaaa 
gcccccctga 
gactataaag 
ccctgccgct 



agtgaattcc 
ctgactcaac 
tttatttagt 
agccgcagta 
cagatcaagg 
agcagtttcg 
cggggccaag 
atcagatgtt 
ccaatcagcc 
gctcacaacc 
tgtatccaat 
ggtctcctca 
gatctggaga 
tcttatatgg 
ctaacagccc 
ggagaccact 
accgggtcgg 
ggaaaggacc 
cagcttggat 
gacggacatg 
gcccttaata 
ctttaatgta 
cctcctggga 
cggagaggag 
cgcagggaga 
gtcggggtgt 
acaggcttac 
cccctgggac 
atccaattcc 
cactgatgca 
ccggacagga 
accccgagtc 
aatagagatt 
cactaccagt 
aactggagat 
tcccgacaag 
agtagcggtc 
ttcccaacat 
acctaaaact 
ccttgcagca 
caccacggtg 
aatttaccac 
agagccagta 
agctggaata 
tgccgctatc 
actgacctcg 
aaaggaggga 
ggggctagtg 
atttgagaca 
cttaatctcc 
ttgcattctc 
agtcctgact 
gtgttgacaa 
gggccaccat 
gagcggtcga 
tcgccggtgt 
tgccggacaa 
ggtcggaggt 
gcgagcagcc 
tcgtggccga 
gcttataatg 
tcactgcatt 
cagatctggg 
ggccaggaac 
cgagcatcac 
ataccaggcg 
taccggatac 



gattagttca 
aataccacca 
ttccagaaaa 
acgccatttt 
gcgggtacac 
gccccggccc 
aacagatggt 
tccaggctcc 
tgcttctcgc 
cctcactcgg 
aaatcctctt 
gagtgattga 
cccctgccca 
ggcacccccg 
ctctctccaa 
ggcggcagct 
cgacacagtg 
ttacacagtc 
acacgcagcc 
gcgcgttcaa 
gtcatgggag 
acctggagag 
actgtacaag 
tgggaccctt 
cagcggaccc 
gggggaccag 
tggaagccca 
acgggatgct 
ttccaagggg 
ggaaaaaagg 
acagatccta 
cccatagggc 
gtaccggctc 
acaccctcaa 
agactactag 
acccaagaat 
gtgggcactt 
aagcttaccc 
caccaggcct 
cccgccggaa 
ctcaatctaa 
tcccccgatt 
tcattgaccc 
gggacgggga 
cagacagacc 
ttgtctgaag 
ggtctctgcg 
agagacagca 
ggccaaggat 
accatcatgg 
aatcgattag 
caacaatacc 
ttaatcatcg 
ggccaagttg 
gttctggacc 
ggtccgggac 
caccctggcc 
cgtgtccacg 
gtgggggcgg 
ggagcaggac 
gttacaaata 
ctagttgtgg 
cccatgcggc 
cgtaaaaagg 
aaaaatcgac 
tttccccctg 
ctgtccgcct 



atttgttaaa 
gctaaaacca 

aggggggaat 

gcaaggcatg 
gaaaacagct 
ggggccaaga 
ccccagatat 
cccaaggacc 
ttctgttcgc 
cgcgccagtc 
gctgttgcat 
ctacccgtct 
gggaccaccg 
ccccttgtaa 
gctcacttac 
taccaagaac 
tgggtccgcc 
ctgctgacca 
cacgtaaagg 
cgctctcaaa 
tcctgttagg 
tcaccaacct 
atgccttccc 
cagaccagga 
ggacttttga 
gagagggcta 
catcatcgtg 
ctaaagttgc 
ctactcgagg 
ctaactggga 
ttaccatgtt 
ccaacccagt 
cacagccacc 
cctcccctac 
ctctagtcaa 
gttggctgtg 
ataccaatca 
tatctgaagt 
tatgtaacac 
caatgtgggc 
ccacagatta 
atatgtatgg 
tggcccttct 
ccactgcctt 
tcaacgaagt 
tagtcctaca 
cagccctaaa 
tggccaaatt 
ggttcgaagg 
gacctctaat 
ttcaatttgt 
accagctaaa 
gcatagtata 
accagtgccg 
gaccggctcg 
gacgtgaccc 
tgggtgtggg 
aacttccggg 
gagttcgccc 
tgannnncgg 
aagcaatagc 
tttgtccaaa 
cgcggatcga 
ccgcgttgct 
gctcaagtca 
gaagctccct 
ttctcccttc 



240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1740 

1800 

1860 

1920 

1980 

2040 

2100 

2160 

2220 

2280 

2340 

2400 

2460 

2520 

2580 

2640 

2700 

2760 

2820 

2880 

2940 

3000 

3060 

3120 

3180 

3240 

3300 

3360 

3420 

3480 

3540 

3600 

3660 

3720 

3780 

3840 

3900 

3960 

4020 

4080 

4140 

4200 




gggaagcgtg gcgctttctc aatgctcacg ctgtaggtat ctcagttcgg tgtaggtcgt 4260 
tcgctccaag ctgggctgtg tgcacgaacc ccccgttcag cccgaccgct gcgccttatc 4320 
cggtaactat cgtcttgagt ccaacccggt aagacacgac ttatcgccac tggcagcagc 4380 
cactggtaac aggattagca gagcgaggta tgtaggcggt gctacagagt tcttgaagtg 4440 
gtggcctaac tacggctaca ctagaaggac agtatttggt atctgcgctc tgctgaagcc 4500 
agttaccttc ggaaaaagag ttggtagctc ttgatccggc aaacaaacca ccgctggtag 4560 
cggtggtttt tttgtttgca agcagcagat tacgcgcaga aaaaaaggat ctcaagaaga 4620 
tcctttgatc ttttctacgg ggtctgacgc tcagtggaac gaaaactcac gttaagggat 4680 
tttggtcatg agattatcaa aaaggatctt cacctagatc cttttaaatt aaaaatgaag 4740 
ttttaaatca atctaaagta tatatgagta aacttggtct gacagttacc aatgcttaat 4800 
cagtgaggca cctatctcag cgatctgtct atttcgttca tccatagttg cctgactccc 4860 
cgtcgtgtag ataactacga tacgggaggg cttaccatct ggccccagtg ctgcaatgat 4920 
accgcgagac ccacgctcac cggctccaga tttatcagca ataaaccagc cagccggaag 4980 
ggccgagcgc agaagtggtc ctgcaacttt atccgcctcc atccagtcta ttaattgttg 5040 
ccgggaagct agagtaagta gttcgccagt taatagtttg cgcaacgttg ttgccattgc 5100 
tacaggcatc gtggtgtcac gctcgtcgtt tggtatggct tcattcagct ccggttccca 5160 
acgatcaagg cgagttacat gatcccccat gttgtgcaaa aaagcggtta gctccttcgg 5220 
tcctccgatc gttgtcagaa gtaagttggc cgcagtgtta tcactcatgg ttatggcagc 5280 
actgcataat tctcttactg tcatgccatc cgtaagatgc ttttctgtga ctggtgagta 5340 
ctcaaccaag tcattctgag aatagtgtat gcggcgaccg agttgctctt gcccggcgtc 5400 
aatacgggat aataccgcgc cacatagcag aactttaaaa gtgctcatca ttggaaaacg 5460 
ttcttcgggg cgaaaactct caaggatctt accgctgttg agatccagtt cgatgtaacc 5520 
cactcgtgca cccaactgat cttcagcatc ttttactttc accagcgttt ctgggtgagc 5580 
aaaaacagga aggcaaaatg ccgcaaaaaa gggaataagg gcgacacgga aatgttgaat 5640 
actcatactc ttcctttttc aatattattg aagcatttat cagggttatt gtctcatgag 5700 
cggatacata tttgaatgta tttagaaaaa taaacaaata ggggttccgc gcacatttcc 5760 
ccgaaaagtg ccacctgacg tctaagaaac cattattatc atgacattaa cctataaaaa 5820 
taggcgtatc acgaggccct ttcgtctcgc gcgtttcggt gatgacggtg aaaacctctg 5880 
acacatgcag ctcccggaga cggtcacagc ttgtctgtaa gcggatgccg ggagcagaca 5940 
agcccgtcag ggcgcgtcag cgggtgttgg cgggtgtcgg ggctggctta actatgcggc 6000 
atcagagcag attgtactga gagtgcac 6028 

<210> 6 
<211> 6061 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Portion of 
construct 

<220> 

<221> misc_feature 
<222> (3807) 

<223> n is any nucleotide 
<220> 

<221> misc_f eature 
<222> (3808) 

<223> n is any nucleotide 
<220> 

<221> misc_feature 
<222> (3809) 

<223> n is any nucleotide 
<220> 

<221> misc_feature 
<222> (3810) 

<223> n is any nucleotide 
<220> 

<221> misc_feature 
<222> (3995) 

<223> n is any nucleotide 



<220> 

<221> misc_feature 
<222> (3996) 

<223> n is any nucleotide 
<220> 

<221> misc_feature 
<222> (3997) 

<223> n is any nucleotide 
<220> 

<221> misc_feature 
<222> (3998) 

<223> n is any nucleotide 
<400> 6 

catatgcggt gtgaaatacc gcacagatgc 
tcgccattca ggctgcgcaa ctgttgggaa 
cgccagctgg cgaaaggggg atgtgctgca 
tcccagtcac gacgttgtaa aacgacggcc 
gacaggatct cagtagtcca ggctttagtc 
ctagaatacg agccacaata aataaaagat 
gaaagacccc accaaattgc ttagcctgat 
gaaaaatacc aaaccaagaa tagagaagtt 
aacgttgggc caaacaggat atctgcggtg 
acagatggtc accgcggttc ggccccggcc 
ggcccaaccc tcagcagttt cttaagaccc 
tgaaatgacc ctgtgcctta tttgaattaa 
gcgcttctgc ttcccgagct ctataaaaga 
ctccgataga ctgagtcgcc cgggtacccg 
ccgactcgtg gtctcgctgt tccttgggag 
cgggggtctt tcatttgggg gctcgtccgg 
acccaccacc gggaggtaag ctggccaaga 
acttccctga ccctgacatg acaagagtta 
aggctctcta cttagtccag cacgaagtct 
aactggaccg accggtggta cctcaccctt 
gacaccagac taagaaccta gaacctcgct 
cccccaccgc cctcaaagta gacggcatcg 
ctgccgaccc cgggggtgga ccatcctcta 
accccttaaa aataaggtta acccgcgagg 
cagaggggtc agtactgctt cgcccggctc 
ggaggtaacc aatggagatc gggagacggt 
gacctggtgg cctgacctta ccccagattt 
ttgggggcta gaatatcaat cccctttttc 
gggcagcagc ccaggctgtt ccagagactg 
gtgcaacact gcctggaaca gactcaagct 
attttatgtt tgccccgggc cccaccgccc 
ctccttctac tgtgcctatt ggggctgtga 
ctcatcatgg gatttcatca cagtaaacaa 
atgcaaagat aataagtggt gcaacccctt 
ggttacttcc tggaccacag gacattactg 
tccagggctt acatttggga tccgactcag 
agggccaaac cccgttctgg cagaccaaca 
gccttcagtc accaaaccac ccagtgggac 
gggaacggaa aataggctgc taaacttagt 
cagtcctgac aaaacccaag agtgctggtt 
aggggttgcc gtcctgggta cctactccaa 
ggcctcccaa cacaagttga ccctgtccga 
agttcccaaa acacatcagg ccctatgtaa 
ttatctagtt gcccctacag gtaccatgtg 
ctccaccacc atactgaacc ttaccactga 
agtcacctat cattccccca gctatgttta 
aagagaaccg gtgtcgttaa ccctggccct 
tgccgctgga ataggaacag ggactactgc 
ccaagccgca gtacaggatg atctcaggga 
gtctctcact tccctgtctg aagttgtcct 
tctaaaagaa ggagggctgt gtgctgctct 



gtaaggagaa aataccgcat caggcgccat 60 
gggcgatcgg tgcgggcctc ttcgctatta 120 
aggcgattaa gttgggtaac gccagggttt 180 
agtgaattcc gattagttca atttgttaaa 240 
ctgactcaac aataccacca gctaaaacca 300 
tttatttagt ttccagaaaa aggggggaat 360 
agccgcagta acgccatttt gcaaggcatg 420 
cagatcaagg gcgggtacac gaaaacagct 480 
agcagtttcg gccccggccc ggggccaaga 540 
cggggccaag aacagatggt ccccagatat 600 
atcagatgtt tccaggctcc cccaaggacc 660 
ccaatcagcc tgcttctcgc ttctgttcgc 720 
gctcacaacc cctcactcgg cgcgccagtc 780 
tgtatccaat aaatcctctt gctgttgcat 840 
ggtctcctca gagtgattga ctacccgtct 900 
gatctggaga cccctgccca gggaccaccg 960 
tcttatatgg ggcacccccg ccccttgtaa 1020 
ctaacagccc ctctctccaa gctcacttac 1080 
ggagacctct ggcggcagcc taccaagaac 1140 
accgagtcgg cgacacagtg tgggtccgcc 1200 
ggaaaggacc ttacacagtc ctgctgacca 1260 
cagcttggat acacgccgcc cacgtgaagg 1320 
gactgacatg gcgcgttcaa cgctctcaaa 1380 
ccccctaatc cccttaattc ttctgatgct 1440 
cagtcctcat caagtctata atatcacctg 1500 
atgggcaact tctggcaacc accctctgtg 1560 
atgtatgtta gcccaccatg gaccatctta 1620 
ttctcccccg gggccccctt gttgctcagg 1680 
cgaagaacct ttaacctccc tcacccctcg 1740 
agaccagaca actcataaat caaatgaggg 1800 
ccgagaatcc aagtcatgtg ggggtccaga 1860 
gacaaccggt agagcttact ggaagccctc 1920 
caatctcacc tctgaccagg ctgtccaggt 1980 
agttattcgg tttacagacg ccgggagacg 2040 
gggcttacgt ttgtatgtct ccggacaaga 2100 
ataccaaaat ctaggacccc gcgtcccaat 2160 
gccactctcc aagcccaaac ctgttaagtc 2220 
tcctctctcc cctacccaac ttccaccggc 2280 
agacggagcc taccaagccc tcaacctcac 2340 
gtgtctagta gcgggacccc cctactacga 2400 
ccatacctct gctccagcca actgctccgt 2460 
agtgaccgga cagggactct gcataggagc 2520 
taccacccag acaagcagtc gagggtccta 2580 
ggcttgtagt accgggctta ctccatgcat 2640 
ttattgtgtt cttgtcgaac tctggccaag 2700 
cggcctgttt gagagatcca accgacacaa 2760 
attattgggt ggactaacca tggggggaat 2820 
tctaatggcc actcagcaat tccagcagct 2880 
ggttgaaaaa tcaatctcta acctagaaaa 2940 
acagaatcga aggggcctag acttgttatt 3000 
aaaagaagaa tgttgcttct atgcggacca 3060 



cacaggacta 
actgtttgag 
caccttgata 
accctgcatt 
tttagtcctg 
ctagtgttga 
ggagggccac 
ccggagcggt 
acttcgccgg 
tggtgccgga 
agtggtcgga 
tcggcgagca 
acttcgtggc 
gcagcttata 
ttttcactgc 
atccagatct 
aaaggccagg 
tgacgagcat 
aagataccag 
gcttaccgga 
acgctgtagg 
accccccgtt 
ggtaagacac 
gtatgtaggc 
gacagtattt 
ctcttgatcc 
gattacgcgc 
cgctcagtgg 
cttcacctag 
gtaaacttgg 
tctatttcgt 
gggcttacca 
agatttatca 
tttatccgcc 
agttaatagt 
gtttggtatg 
catgttgtgc 
ggccgcagtg 
atccgtaaga 
tatgcggcga 
cagaacttta 
cttaccgctg 
atcttttact 
aaagggaata 
ttgaagcatt 
aaataaacaa 
aaccattatt 
cgcgcgtttc 
agcttgtctg 
tggcgggtgt 
c 



gtgagagaca 
tcaactcaag 
tctaccatta 
cttaatcgat 
actcaacaat 
caattaatca 
catggccaag 
cgagttctgg 
tgtggtccgg 
caacaccctg 
ggtcgtgtcc 
gccgtggggg 
cgaggagcag 
atggttacaa 
attctagttg 
gggcccatgc 
aaccgtaaaa 
cacaaaaatc 
gcgtttcccc 
tacctgtccg 
tatctcagtt 
cagcccgacc 
gacttatcgc 
ggtgctacag 
ggtatctgcg 
ggcaaacaaa 
agaaaaaaag 
aacgaaaact 
atccttttaa 
tctgacagtt 
tcatccatag 
tctggcccca 
gcaataaacc 
tccatccagt 
ttgcgcaacg 
gcttcattca 
aaaaaagcgg 
ttatcactca 
tgcttttctg 
ccgagttgct 
aaagtgctca 
ttgagatcca 
ttcaccagcg 
agggcgacac 
tatcagggtt 
ataggggttc 
atcatgacat 
ggtgatgacg 
taagcggatg 
cggggctggc 



gcatggccaa 
gatggtttga 
tgggacccct 
tagttcaatt 
accaccagct 
tcggcatagt 
ttgaccagtg 
accgaccggc 
gacgacgtga 
gcctgggtgt 
acgaacttcc 
cgggagttcg 
gactgannnn 
ataaagcaat 
tggtttgtcc 
ggccgcggat 
aggccgcgtt 
gacgctcaag 
ctggaagctc 
cctttctccc 
cggtgtaggt 
gctgcgcctt 
cactggcagc 
agttcttgaa 
ctctgctgaa 
ccaccgctgg 
gatctcaaga 
cacgttaagg 
attaaaaatg 
accaatgctt 
ttgcctgact 
gtgctgcaat 
agccagccgg 
ctattaattg 
ttgttgccat 
gctccggttc 
ttagctcctt 
tggttatggc 
tgactggtga 
cttgcccggc 
tcattggaaa 
gttcgatgta 
tttctgggtg 
ggaaatgttg 
attgtctcat 
cgcgcacatt 
taacctataa 
gtgaaaacct 
ccgggagcag 
ttaactatgc 



attgagagag 
gggactgttt 
cattgtactc 
tgttaaagac 
aaagcctata 
atacggcata 
ccgttccggt 
tcgggttctc 
ccctgttcat 
gggtgcgcgg 
gggacgcctc 
ccctgcgcga 
cggaccggtc 
agcatcacaa 
aaactcatca 
cgatnnnnac 
gctggcgttt 
tcagaggtgg 
cctcgtgcgc 
ttcgggaagc 
cgttcgctcc 
atccggtaac 
agccactggt 
gtggtggcct 
gccagttacc 
tagcggtggt 
agatcctttg 
gattttggtc 
aagttttaaa 
aatcagtgag 
ccccgtcgtg 
gataccgcga 
aagggccgag 
ttgccgggaa 
tgctacaggc 
ccaacgatca 
cggtcctccg 
agcactgcat 
gtactcaacc 
gtcaatacgg 
acgttcttcg 
acccactcgt 
agcaaaaaca 
aatactcata 
gagcggatac 
tccccgaaaa 
aaataggcgt 
ctgacacatg 
acaagcccgt 
ggcatcagag 



aggcttaatc 
aacagatccc 
ctaatgattt 
aggatctcag 
gagtacgagc 
gtataatacg 
gctcaccgcg 
ccgggacttc 
cagcgcggtc 
cctggacgag 
cgggccggcc 
cccggccggc 
gacttgttaa 
atttcacaaa 
atgtatctta 
atgtgagcaa 
ttccataggc 
cgaaacccga 
tctcctgttc 
gtggcgcttt 
aagctgggct 
tatcgtcttg 
aacaggatta 
aactacggct 
ttcggaaaaa 
ttttttgttt 
atcttttcta 
atgagattat 
tcaatctaaa 
gcacctatct 
tagataacta 
gacccacgct 
cgcagaagtg 
gctagagtaa 
atcgtggtgt 
aggcgagtta 
atcgttgtca 
aattctctta 
aagtcattct 
gataataccg 
gggcgaaaac 
gcacccaact 
ggaaggcaaa 
ctcttccttt 
atatttgaat 
gtgccacctg 
atcacgaggc 
cagctcccgg 
cagggcgcgt 
cagattgtac 



agagacagaa 
cttggtttac 
tgctcttcgg 
tagtccaggc 
catagggcgc 
actcactata 
cgcgacgtcg 
gtggaggacg 
caggaccagg 
ctgtacgccg 
atgaccgaga 
aactgcgtgc 
cttgtttatt 
taaagcattt 
tcatgtctgg 
aaggccagca 
tccgcccccc 
caggactata 
cgaccctgcc 
ctcaatgctc 
gtgtgcacga 
agtccaaccc 
gcagagcgag 
acactagaag 
gagttggtag 
gcaagcagca 
cggggtctga 
caaaaaggat 
gtatatatga 
cagcgatctg 
cgatacggga 
caccggctcc 
gtcctgcaac 
gtagttcgcc 
cacgctcgtc 
catgatcccc 
gaagtaagtt 
ctgtcatgcc 
gagaatagtg 
cgccacatag 
tctcaaggat 
gatcttcagc 
atgccgcaaa 
ttcaatatta 
gtatttagaa 
acgtctaaga 
cctttcgtct 
agacggtcac 
cagcgggtgt 
tgagagtgca 



3120 
3180 
3240 
3300 
3360 
3420 
3480 
3540 
3600 
3660 
3720 
3780 
3840 
3900 
3960 
4020 
4080 
4140 
4200 
4260 
4320 
4380 
4440 
4500 
4560 
4620 
4680 
4740 
4800 
4860 
4920 
4980 
5040 
5100 
5160 
5220 
5280 
5340 
5400 
5460 
5520 
5580 
5640 
5700 
5760 
5820 
5880 
5940 
6000 
6060 
6061 



<210> 7 
<211> 6312 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Portion of 
construct 



<220> 

<221> misc_f eature 
<222> (4058) 

<223> n is any nucleotide 



<220> 

<221> misc_f eature 
<222> (4059) 

<223> n is any nucleotide 
<220> 

<221> misc_feature 
<222> (4060) 

<223> n is any nucleotide 
<220> 

<221> misc_feature 
<222> (4061) 

<223> n is any nucleotide 
<220> 

<221> misc_feature 
<222> (4246) 

<223> n is any nucleotide 
<220> 

<221> misc_feature 
<222> (4247) 

<223> n is any nucleotide 
<220> 

<221> misc_feature 
<222> (4248) 

<223> n is any nucleotide 
<220> 

<221> misc_feature 
<222> (4249) 

<223> n is any nucleotide 



<400> 7 

catatgcggt 

tcgccattca 

cgccagctgg 

tcccagtcac 

gacaggatct 

ctagaatacg 

gaaagacccc 

gaaaaatacc 

aacgttgggc 

acagatggtc 

ggcccaaccc 

tgaaatgacc 

gcgcttctgc 

ctccgataga 

ccgactcgtg 

cgggggtctt 

acccaccacc 

cggcctttgt 

tacattgtgc 

aagagacctt 

ccttagcgct 

ttctctatgg 

gatttctccc 

gggaccagat 

tcggggatca 

gcccatacct 

gggtccatgc 

tggaaaagac 

aataagaacc 

gttgtctggg 

gatgtatgtg 



gtgaaatacc 
ggctgcgcaa 
cgaaaggggg 
gacgttgtaa 
cagtagtcca 
agccacaata 
accaaattgc 
aaaccaagaa 
caaacaggat 
accgcggttc 
tcagcagttt 
ctgtgcctta 
ttcccgagct 
ctgagtcgcc 
gtctcgctgt 
tcatttgggg 
gggaggtaag 
tgctcaggta 
gtatagaccc 
gaccaaatta 
gcttagggcc 
aggaccaccc 
tgtcttattt 
caaagaggtg 
agtgcttgtc 
ggtgttgctg 
ttctcacctc 
tgatcatcct 
cccaccagcc 
atacaaaggc 
ccttggcggc 



gcacagatgc 
ctgttgggaa 
atgtgctgca 
aacgacggcc 
ggctttagtc 
aataaaagat 
ttagcctgat 
tagagaagtt 
atctgcggtg 
ggccccggcc 
cttaagaccc 
tttgaattaa 
ctataaaaga 
cgggtacccg 
tccttgggag 
gctcgtccgg 
ctggccaaga 
agtcagggac 
cagagctcag 
gccttagaga 
aggaataccc 
cccatacttg 
actcacttaa 
tataagcctg 
agacgccatc 
actaccccga 
aaacctgcac 
cttaagctgc 
catgaccctc 
agtccagccc 
tagtcttgag 



gtaaggagaa 
gggcgatcgg 
aggcgattaa 
agtgaattcc 
ctgactcaac 
tttatttagt 
agccgcagta 
cagatcaagg 
agcagtttcg 
cggggccaag 
atcagatgtt 
ccaatcagcc 
gctcacaacc 
tgtatccaat 
ggtctcctca 
gatctggaga 
tccctaaggt 
tggccactca 
gtcaggtaga 
ccggtggaaa 
ctggccggtt 
agtctggaga 
aggctttaga 
gtaccgtaac 
gacccagcag 
ccgcggtaaa 
caccttcggc 
gtattcggcg 
acttggcagg 
ccttggactt 
tcctgggata 



aataccgcat 
tgcgggcctc 
gttgggtaac 
gattagttca 
aataccacca 
ttccagaaaa 
acgccatttt 
gcgggtacac 
gccccggccc 
aacagatggt 
tccaggctcc 
tgcttctcgc 
cctcactcgg 
aaatcctctt 
gagtgattga 
cccctgccca 
actcgggtca 
actggggata 
aagaatgaac 
agactgggtg 
tggtttaact 
aactttgggt 
aattgtaagg 
aatccctcac 
ccttgagcct 
agtcgatggt 
accagatgag 
gcggcgggac 
tactgtccca 
ggtggcccac 
tcccgggaac 



caggcgccat 
ttcgctatta 
gccagggttt 
atttgttaaa 
gctaaaacca 
aggggggaat 
gcaaggcatg 
gaaaacagct 
ggggccaaga 
ccccagatat 
cccaaggacc 
ttctgttcgc 
cgcgccagtc 
gctgttgcat 
ctacccgtct 
gggaccaccg 
gacaatggcc 
aattggaagt 
agaacaatta 
accctccttc 
ccttatgaaa 
cccgatgata 
acccaaatct 
ccgttccagg 
cggtggaaag 
attgctgcct 
tcctgggagc 
gagtctgcaa 
aactggagac 
acttaaacct 
cgatgtctcg 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1740 

1800 

1860 



tcctctaaac 

tggggagcca 

tacgtatgtc 

tccctatact 

tcctcaaaag 

caacagtgtc 

aaattatcca 

catccaggcg 

ggtcctgacc 

cctcttcccc 

gcgactagtg 

cccaccacag 

accaacccag 

gaagcaatag 

gggacccaag 

gtgcccttta 

catcagtatc 

tgcctctcca 

cctcgcatct 

aggactaaaa 

gcgggaatag 

ctgacaagcc 

aagttagagg 

gacttgctgt 

tacatagacc 

aaaagacagt 

ccttggttca 

ctcatcctcg 

gtagtccagg 

ccatagggcg 

gactcactat 

gcgcgacgtc 

cgtggaggac 

ccaggaccag 

gctgtacgcc 

catgaccgag 

caactgcgtg 

acttgtttat 

ataaagcatt 

atcatgtctg 

aaaggccagc 

ctccgccccc 

acaggactat 

ccgaccctgc 

tctcaatgct 

tgtgtgcacg 

gagtccaacc 

agcagagcga 

tacactagaa 

agagttggta 

tgcaagcagc 

acggggtctg 

tcaaaaagga 

agtatatatg 

tcagcgatct 

acgatacggg 

tcaccggctc 

ggtcctgcaa 

agtagttcgc 

tcacgctcgt 

acatgatccc 

agaagtaagt 

actgtcatgc 

tgagaatagt 

gcgccacata 

ctctcaagga 

tgatcttcag 



gagtcagacc 

tagggtgcag 

cccgggatgg 

gtaaagaatg 

acctcataac 

accagaccgg 

aggactggat 

tacagttcac 

tcgtccttgt 

caagggaagc 

cacaaactcc 

gcgacagact 

gggccactga 

cctcatcagg 

gaaagctcac 

cccatcagca 

tgctcccctc 

cctcagtttt 

attactatcc 

gagaggctgt 

gtactggttc 

tccagatcgc 

actcactgac 

ttctaaaaga 

actcaggtgc 

tagagcgcca 

ctaccctgct 

ggccatgcat 

ctttagtcct 

cctagtgttg 

aggagggcca 

gccggagcgg 

gacttcgccg 

gtggtgccgg 

gagtggtcgg 

atcggcgagc 

cacttcgtgg 

tgcagcttat 

tttttcactg 

gatccagatc 

aaaaggccag 

ctgacgagca 

aaagatacca 

cgcttaccgg 

cacgctgtag 

aaccccccgt 

cggtaagaca 

ggtatgtagg 

ggacagtatt 

gctcttgatc 

agattacgcg 

acgctcagtg 

tcttcaccta 

agtaaacttg 

gtctatttcg 

agggcttacc 

cagatttatc 

ctttatccgc 

cagttaatag 

cgtttggtat 

ccatgttgtg 

tggccgcagt 

catccgtaag 

gtatgcggcg 

gcagaacttt 

tcttaccgct 

catcttttac 



tccggactca 

ctaccctcgg 

ccggaccctt 

ggattgtgag 

tgtaaaatgg 

ctggtgtaac 

aacgggaaaa 

cattcgctta 

ggaacaagga 

gccaccgcca 

cacggtgaga 

ttttgatctt 

gtcttgctgg 

agaggtcgcc 

cctcactgag 

tctctgcaat 

caaccatagc 

taatcagact 

tgaagaagtt 

ctcacttacc 

aactgcctta 

catagatgct 

ttccctgtcc 

aggtggcctc 

agtacgggac 

gaaaagccaa 

atcaaccatc 

catcaatcga 

gactcaacaa 

acaattaatc 

ccatggccaa 

tcgagttctg 

gtgtggtccg 

acaacaccct 

aggtcgtgtc 

agccgtgggg 

ccgaggagca 

aatggttaca 

cattctagtt 

tgggcccatg 

gaaccgtaaa 

tcacaaaaat 

ggcgtttccc 

atacctgtcc 

gtatctcagt 

tcagcccgac 

cgacttatcg 

cggtgctaca 

tggtatctgc 

cggcaaacaa 

cagaaaaaaa 

gaacgaaaac 

gatcctttta 

gtctgacagt 

ttcatccata 

atctggcccc 

agcaataaac 

ctccatccag 

tttgcgcaac 

ggcttcattc 

caaaaaagcg 

gttatcactc 

atgcttttct 

accgagttgc 

aaaagtgctc 

gttgagatcc 

tttcaccagc 



gactatactg 

gctaggacta 

tcagaagcta 

accacgggga 

gaccaaaata 

ccccttaaaa 

acctggggat 

aaaatcacca 

cctcctagaa 

tctctccccg 

aaaacaattg 

gtgcaggggg 

ctttgtttgg 

tactccaccg 

gtctcaggac 

cagaccctat 

tggtgggctt 

agagatttct 

ttgttacagg 

ctagctgttt 

attaaaggac 

gacctccggg 

gaggtagtgc 

tgtgcggccc 

tccatgaaaa 

aactggtatg 

gctgggcccc 

ttagttcaat 

taccaccagc 

atcggcatag 

gttgaccagt 

gaccgaccgg 

ggacgacgtg 

ggcctgggtg 

cacgaacttc 

gcgggagttc 

ggactgannn 

aataaagcaa 

gtggtttgtc 

cggccgcgga 

aaggccgcgt 

cgacgctcaa 

cctggaagct 

gcctttctcc 

tcggtgtagg 

cgctgcgcct 

ccactggcag 

gagttcttga 

gctctgctga 

accaccgctg 

ggatctcaag 

tcacgttaag 

aattaaaaat 

taccaatgct 

gttgcctgac 

agtgctgcaa 

cagccagccg 

tctattaatt 

gttgttgcca 

agctccggtt 

gttagctcct 

atggttatgg 

gtgactggtg 

tcttgcccgg 

atcattggaa 

agttcgatgt 

gtttctgggt 



ccgcttataa 

gaatggcaag 

gaaggtgcgg 

ccggttattg 

gcgaatggac 

tagatttcac 

taagattcta 

acatgccagc 

cgtccctcgc 

actctaactc 

ttaccctaaa 

ccttcctaac 

ccatgggccc 

accttgaccg 

acgggttgtg 

ccatcaattc 

gcagcactgg 

gtatccaggt 

cctatgacaa 

tactggggtt 

ctatagacct 

ccctccaaga 

tccaaaatag 

taaaggaaga 

aactcaaaga 

aaggatggtt 

tattactcct 

ttgttaaaga 

taaagcctat 

tatacggcat 

gccgttccgg 

ctcgggttct 

accctgttca 

tgggtgcgcg 

cgggacgcct 

gccctgcgcg 

ncggaccggt 

tagcatcaca 

caaactcatc 

tcgatnnnna 

tgctggcgtt 

gtcagaggtg 

ccctcgtgcg 

cttcgggaag 

tcgttcgctc 

tatccggtaa 

cagccactgg 

agtggtggcc 

agccagttac 

gtagcggtgg 

aagatccttt 

ggattttggt 

gaagttttaa 

taatcagtga 

tccccgtcgt 

tgataccgcg 

gaagggccga 

gttgccggga 

ttgctacagg 

cccaacgatc 

tcggtcctcc 

cagcactgca 

agtactcaac 

cgtcaatacg 

aacgttcttc 

aacccactcg 

gagcaaaaac 



gcaaatcacc 

ctctaccttc 

ggggctagaa 

gctatctaaa 

tcaaaaattt 

agacaaagga 

tgtgtctgga 

tgtggcagta 

tctcccacct 

cacagccctg 

cactccgcct 

cttaaatgct 

cccttattat 

gtgccgctgg 

cataggaaag 

ctccggagac 

cctcacccct 

ccagctgatt 

ttctcacccc 

gggaatcacg 

ccagcaaggc 

ctcagtcagc 

gagaggcctt 

gtgctgtttt 

aaaactggat 

caataactcc 

ccttctgttg 

caggatctca 

agagtacgag 

agtataatac 

tgctcaccgc 

cccgggactt 

tcagcgcggt 

gcctggacga 

ccgggccggc 

acccggccgg 

cgacttgtta 

aatttcacaa 

aatgtatctt 

catgtgagca 

tttccatagg 

gcgaaacccg 

ctctcctgtt 

cgtggcgctt 

caagctgggc 

ctatcgtctt 

taacaggatt 

taactacggc 

cttcggaaaa 

tttttttgtt 

gatcttttct 

catgagatta 

atcaatctaa 

ggcacctatc 

gtagataact 

agacccacgc 

gcgcagaagt 

agctagagta 

catcgtggtg 

aaggcgagtt 

gatcgttgtc 

taattctctt 

caagtcattc 

ggataatacc 

ggggcgaaaa 

tgcacccaac 

aggaaggcaa 



1920 

1980 

2040 

2100 

2160 

2220 

2280 

2340 

2400 

2460 

2520 

2580 

2640 

2700 

2760 

2820 

2880 

2940 

3000 

3060 

3120 

3180 

3240 

3300 

3360 

3420 

3480 

3540 

3600 

3660 

3720 

3780 

3840 

3900 

3960 

4020 

4080 

4140 

4200 

4260 

4320 

4380 

4440 

4500 

4560 

4620 

4680 

4740 

4800 

4860 

4920 

4980 

5040 

5100 

5160 

5220 

5280 

5340 

5400 

5460 

5520 

5580 

5640 

5700 

5760 

5820 

5880 



aatgccgcaa 
tttcaatatt 
tgtatttaga 
gacgtctaag 
ccctttcgtc 
gagacggtca 
tcagcgggtg 
ctgagagtgc 



aaaagggaat 
attgaagcat 
aaaataaaca 
aaaccattat 
tcgcgcgttt 
cagcttgtct 
ttggcgggtg 



aagggcgaca 
ttatcagggt 
aataggggtt 
tatcatgaca 
cggtgatgac 
gtaagcggat 
tcggggctgg 



cggaaatgtt 
tattgtctca 
ccgcgcacat 
ttaacctata 
ggtgaaaacc 
gccgggagca 
cttaactatg 



gaatactcat 
tgagcggata 
ttccccgaaa 
aaaataggcg 
tctgacacat 
gacaagcccg 
cggcatcaga 



actcttcctt 
catatttgaa 
agtgccacct 
tatcacgagg 
gcagctcccg 
tcagggcgcg 
gcagattgta 



5940 
6000 
6060 
6120 
6180 
6240 
6300 
6312 



<210> 8 
<211> 5865 
<212> DNA 

<213> Artificial Sequence 



<220> 

<223> Description of Artificial Sequence: Portion of 
construct 



<220> 

<221> misc__feature 
<222> (3611) 

<223> n is any nucleotide 
<220> 

<221> misc_feature 
<222> (3612) 

<223> n is any nucleotide 
<220> 

<221> misc_feature 
<222> (3613) 

<223> n is any nucleotide 
<220> 

<221> misc_feature 
<222> (3614) 

<223> n is any nucleotide 
<220> 

<221> misc_feature 
<222> (3799) 

<223> n is any nucleotide 
<220> 

<221> misc_feature 
<222> (3800) 

<223> n is any nucleotide 
<220> 

<221> misc_feature 
<222> (3801) 

<223> n is any nucleotide 
<220> 

<221> misc_feature 
<222> (3802) 

<223> n is any nucleotide 



<400> 8 

catatgcggt gtgaaatacc gcacagatgc 
tcgccattca ggctgcgcaa ctgttgggaa 
cgccagctgg cgaaaggggg atgtgctgca 
tcccagtcac gacgttgtaa aacgacggcc 
gacaggatct cagtagtcca ggctttagtc 
ctagaatacg agccacaata aataaaagat 
gaaagacccc accaaattgc ttagcctgat 

-a a*- ?>f*r+ aaarraanaa t~ a nana ant t 



gtaaggagaa aataccgcat caggcgccat 60 
gggcgatcgg tgcgggcctc ttcgctatta 120 
aggcgattaa gttgggtaac gccagggttt 180 
agtgaattcc gattagttca atttgttaaa 240 
ctgactcaac aataccacca gctaaaacca 300 
tttatttagt ttccagaaaa aggggggaat 360 
agccgcagta acgccatttt gcaaggcatg 420 
caaatcaaaa acaaatacac qaaaacagct 480 



aacgttgggc 
acagatggtc 
ggcccaaccc 
tgaaatgacc 
gcgcttctgc 
ctccgataga 
ccgactcgtg 
cgggggtctt 
acccaccacc 
tttatggggg 
agactgattt 
ccctggccga 
actccgtgta 
acatcgtcct 
acgcatcgca 
agctccgccg 
accttgtccc 
ctaataatag 
caacatggta 
atccaacagg 
tgcagagtca 
actttccagg 
aataagacat 
cagatattac 
cccgtttgct 
actaagagag 
gaacttcaat 
cggacttttg 
gcccaagatt 
ccctctttaa 
cccctcttgg 
aacgatacgg 
aatgtcagta 
gcatacacct 
gacattgaca 
catagaccta 
gcattcacca 
catcagttaa 
gtagactcgt 
gaacaaggag 
ggaattgtga 
ctggcaacca 
ctgggacccc 
ctcatggcct 
taccaagcac 
atcatcggca 
caagttgacc 
ctggaccgac 
ccgggacgac 
cctggcctgg 
gtccacgaac 
ggggcgggag 
gcaggactga 
acaaataaag 
gttgtggttt 
atgcggccgc 
aaaaaggccg 
aatcgacgct 
ccccctggaa 
tccgcctttc 
agttcggtgt 
gaccgctgcg 
tcgccactgg 
acagagttct 
tgcgctctgc 
caaaccaccg 
aaaggatctc 



caaacaggat 
accgcggttc 
tcagcagttt 
ctgtgcctta 
ttcccgagct 
ctgagtcgcc 
gtctcgctgt 
tcatttgggg 
gggaggtaag 
acccccccct 
acaagcccga 
attgtaccgg 
cgtccggcgg 
gctgaccacg 
cgccaaggca 
ttcggagaac 
tgtactaacc 
ttcgggcagg 
aaccatgcga 
taacttgccc 
ctccaaaaat 
actcgatgca 
actacacggc 
aaaaccccaa 
ggagtgccac 
tgtggacagt 
accacccctt 
atatcctgaa 
gttggctctg 
cctactccct 
ttcaaccgat 
aacaaataga 
gtcctttatg 
atttacccca 
tcaacccggg 
aacgagctgt 
ccggagctac 
tatctgatgt 
tagctgaagt 
gaatttgttt 
gaaacaaaat 
accctctctg 
tactcaccct 
tcattaatga 
tcaaagctga 
tagtatacgg 
agtgccgttc 
cggctcgggt 
gtgaccctgt 
gtgtgggtgc 
ttccgggacg 
ttcgccctgc 
nnnncggacc 
caatagcatc 
gtccaaactc 
ggatcgatnn 
cgttgctggc 
caagtcagag 
gctccctcgt 
tcccttcggg 
aggtcgttcg 
ccttatccgg 
cagcagccac 
tgaagtggtg 
tgaagccagt 
ctggtagcgg 
aagaagatcc 



atctgcggtg 

ggccccggcc 

cttaagaccc 

tttgaattaa 

ctataaaaga 

cgggtacccg 

tccttgggag 

gctcgtccgg 

ctggccaaga 

ttgtcaacct 

ctaaaagggc 

ccaggacatc 

caccgctctc 

cccaccgcca 

gccccaaaaa 

cctcttaaga 

caaaatgaaa 

gtttgacgac 

atgcagcgga 

aggcaagacg 

ctcacctagc 

cagttcttgt 

caccttgctt 

tcagctccta 

agcccccatc 

ccaaaaaagg 

agccctgccc 

taccactttt 

tttaaaacta 

agcagactcc 

gcagttctcc 

cttaggtgca 

tgccctaaac 

aaactggacc 

ggatgagcca 

acagttcatc 

aggcctaggt 

ccaagtctta 

agttctccaa 

agccttacaa 

aagaacccta 

gaccgggctg 

cctactcata 

tagacttaat 

ggaagaagct 

catagtataa 

cggtgctcac 

tctcccggga 

tcatcagcgc 

gcggcctgga 

cctccgggcc 

gcgacccggc 

ggtcgacttg 

acaaatttca 

atcaatgtat 

nnacatgtga 

gtttttccat 

gtggcgaaac 

gcgctctcct 

aagcgtggcg 

ctccaagctg 

taactatcgt 

tggtaacagg 

gcctaactac 

taccttcgga 

tggttttttt 

tttgatcttt 



agcagtttcg 

cggggccaag 

atcagatgtt 

ccaatcagcc 

gctcacaacc 

tgtatccaat 

ggtctcctca 

gatctggaga 

tcccccgggc 

tgctcaattc 

tgcaaggcgt 

cacaaactag 

aaggattgga 

taaaggttga 

cccctggacc 

taagactctc 

ctcccaacag 

ccccgcaagg 

gggcaggtat 

gcctacttaa 

gggggagaac 

tatactgaat 

aaaatacggt 

cagtcccctt 

catatctccg 

ctagaacaaa 

aaagtcagag 

aggttactcc 

ggtaccccta 

ctagcgaatg 

aactcgtcct 

gtcaccttta 

gggtcagtct 

agactttgcg 

gtccccattc 

cctttactag 

gtctccgtca 

tccggtacca 

aataggaggg 

gaaaaatgct 

caagaagaat 

cagggctttc 

ctaaccattg 

gttgtacatg 

caggattgag 

tacgactcac 

cgcgcgcgac 

cttcgtggag 

ggtccaggac 

cgagctgtac 

ggccatgacc 

cggcaactgc 

ttaacttgtt 

caaataaagc 

cttatcatgt 

gcaaaaggcc 

aggctccgcc 

ccgacaggac 

gttccgaccc 

ctttctcaat 

ggctgtgtgc 

cttgagtcca 

attagcagag 

ggctacacta 

aaaagagttg 

gtttgcaagc 

tctacggggt 



gccccggccc 
aacagatggt 
tccaggctcc 
tgcttctcgc 
cctcactcgg 
aaatcctctt 
gagtgattga 
cccctgccca 
tgcaggaatt 
cttctccccc 
gcaggcccaa 
ccacccattt 
gcctcgttgg 
cgggatcgcc 
agaaactccc 
ccgtgtctga 
gaatggtcat 
ctatcgcatt 
ccgaggcccc 
tgaccaacca 
tccagaactg 
accggcaatg 
ctgggagcct 
gtaggggctc 
atggtggagg 
ttcataaggc 
atgaccttag 
agatgtccaa 
cccctcttgc 
cctcctgtca 
gtttatcttc 
ctaactgcac 
tcctctgtgg 
tccaagcctc 
ctgccattga 
ctggactggg 
cccagtatac 
tacaagattt 
gactggacct 
gtttttatgc 
tacaaaaacg 
ttccgtacct 
ggccatgcgt 
ccatggtgct 
gcgcctagtg 
tataggaggg 
gtcgccggag 
gacgacttcg 
caggtggtgc 
gccgagtggt 
gagatcggcg 
gtgcacttcg 
tattgcagct 
atttttttca 
ctggatccag 
agcaaaaggc 
cccctgacga 
tataaagata 
tgccgcttac 
gctcacgctg 
acgaaccccc 
acccggtaag 
cgaggtatgt 
gaaggacagt 
gtagctcttg 
agcagattac 
ctgacgctca 



ggggccaaga 

ccccagatat 

cccaaggacc 

ttctgttcgc 

cgcgccagtc 

gctgttgcat 

ctacccgtct 

gggaccaccg 

tatgaaatcc 

tccgatccta 

atctggacac 

caggtgggag 

aagggacctt 

gcctggattc 

aaaacctgga 

ctgctaatcc 

tttatgtagc 

agtacaaaaa 

accgaactcc 

aaaatggaaa 

cccctgtaac 

caggcgaatt 

caacgaggta 

tataaatcag 

acccctcgat 

tatgactcct 

ccttgatgca 

ttttagcctt 

gatacccact 

gattatacct 

ccctttcatt 

ctctgtagcc 

aaataacatg 

cctcctcccc 

tcattatata 

aatcaccgca 

aaaattatcc 

acaagaccag 

actaacggca 

taacaagtca 

cagggaaagc 

cctacctctc 

tttcagtcgc 

ggcccagcaa 

ttgacaatta 

ccaccatggc 

cggtcgagtt 

ccggtgtggt 

cggacaacac 

cggaggtcgt 

agcagccgtg 

tggccgagga 

tataatggtt 

ctgcattcta 

atctgggccc 

caggaaccgt 

gcatcacaaa 

ccaggcgttt 

cggatacctg 

taggtatctc 

cgttcagccc 

acacgactta 

aggcggtgct 

atttggtatc 

atccggcaaa 

gcgcagaaaa 

gtggaacgaa 



540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1740 

1800 

1860 

1920 

1980 

2040 

2100 

2160 

2220 

2280 

2340 

2400 

2460 

2520 

2580 

2640 

2700 

2760 

2820 

2880 

2940 

3000 

3060 

3120 

3180 

3240 

3300 

3360 

3420 

3480 

3540 

3600 

3660 

3720 

3780 

3840 

3900 

3960 

4020 

4080 

4140 

4200 

4260 

4320 

4380 

4440 

4500 



aactcacgtt aagggatttt ggtcatgaga ttatcaaaaa ggatcttcac ctagatcctt 4560 

ttaaattaaa aatgaagttt taaatcaatc taaagtatat atgagtaaac ttggtctgac 4620 

agttaccaat gcttaatcag tgaggcacct atctcagcga tctgtctatt tcgttcatcc 4680 

atagttgcct gactccccgt cgtgtagata actacgatac gggagggctt accatctggc 4740 

cccagtgctg caatgatacc gcgagaccca cgctcaccgg ctccagattt atcagcaata 4800 

aaccagccag ccggaagggc cgagcgcaga agtggtcctg caactttatc cgcctccatc 4860 

cagtctatta attgttgccg ggaagctaga gtaagtagtt cgccagttaa tagtttgcgc 4920 

aacgttgttg ccattgctac aggcatcgtg gtgtcacgct cgtcgtttgg tatggcttca 4980 

ttcagctccg gttcccaacg atcaaggcga gttacatgat cccccatgtt gtgcaaaaaa 5040 

gcggttagct ccttcggtcc tccgatcgtt gtcagaagta agttggccgc agtgttatca 5100 

ctcatggtta tggcagcact gcataattct cttactgtca tgccatccgt aagatgcttt 5160 

tctgtgactg gtgagtactc aaccaagtca ttctgagaat agtgtatgcg gcgaccgagt 5220 

tgctcttgcc cggcgtcaat acgggataat accgcgccac atagcagaac tttaaaagtg 5280 

ctcatcattg gaaaacgttc ttcggggcga aaactctcaa ggatcttacc gctgttgaga 5340 

tccagttcga tgtaacccac tcgtgcaccc aactgatctt cagcatcttt tactttcacc 5400 

agcgtttctg ggtgagcaaa aacaggaagg caaaatgccg caaaaaaggg aataagggcg 5460 

acacggaaat gttgaatact catactcttc ctttttcaat attattgaag catttatcag 5520 

ggttattgtc tcatgagcgg atacatattt gaatgtattt agaaaaataa acaaataggg 5580 

gttccgcgca catttccccg aaaagtgcca cctgacgtct aagaaaccat tattatcatg 5640 

acattaacct ataaaaatag gcgtatcacg aggccctttc gtctcgcgcg tttcggtgat 5700 

gacggtgaaa acctctgaca catgcagctc ccggagacgg tcacagcttg tctgtaagcg 5760 

gatgccggga gcagacaagc ccgtcagggc gcgtcagcgg gtgttggcgg gtgtcggggc 5820 

tggcttaact atgcggcatc agagcagatt gtactgagag tgcac 5865 

<210> 9 
<211> 3925 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Portion of 
construct 

<220> 

<221> misc__feature 
<222> (3910) 

<223> n is any nucleotide 
<220> 

<221> misc_feature 
<222> (3911) 

<223> n is any nucleotide 
<220> 

<221> misc_feature 
<222> (3912) 

<223> n is any nucleotide 
<220> 

<221> misc_feature 
<222> (3913) 

<223> n is any nucleotide 
<400> 9 

agatctcccg atcccctatg gtcgactctc agtacaatct gctctgatgc cgcatagtta 60 
agccagtatc tgctccctgc ttgtgtgttg gaggtcgctg agtagtgcgc gagcaaaatt 120 
taagctacaa caaggcaagg cttgaccgac aattgcatga agaatctgct tagggttagg 180 
cgttttgcgc tgcttcgcga tgtacgggcc agatatacgc gttgacattg attattgact 240 
agttattaat agtaatcaat tacggggtca ttagttcata gcccatatat ggagttccgc 300 
gttacataac ttacggtaaa tggcccgcct ggctgaccgc ccaacgaccc ccgcccattg 360 
acgtcaataa tgacgtatgt tcccatagta acgccaatag ggactttcca ttgacgtcaa 420 
tgggtggact atttacggta aactgcccac ttggcagtac atcaagtgta tcatatgcca 480 
agtacgcccc ctattgacgt caatgacggt aaatggcccg cctggcatta tgcccagtac 540 
atgaccttat gggactttcc tacttggcag tacatctacg tattagtcat cgctattacc 600 
atggtgatgc ggttttggca gtacatcaat gggcgtggat agcggtttga ctcacgggga 660 
tttccaagtc tccaccccat tgacgtcaat gggagtttgt tttggcacca aaatcaacgg 720 
gactttccaa aatgtcgtaa caactccgcc ccattgacgc aaatgggcgg taggcgtgta 780 




cggtgggagg tctatataag cagagctctc tggctaacta gagaacccac tgcttaactg 840 
gcttatcgaa atgtcgactg agaacttcag ggtgagtttg gggacccttg attgttcttt 900 
ctttttcgct attgtaaaat tcatgttata tggagggggc aaagttttca gggtgttgtt 960 
tagaatggga agatgtccct tgtatcacca tggaccctca tgataatttt gtttctttca 1020 
ctttctactc tgttgacaac cattgtctcc tcttattttc ttttcatttt ctgtaacttt 1080 
ttcgttaaac tttagcttgc atttgtaacg aatttttaaa ttcacttttg tttatttgtc 1140 
agattgtaag tactttctct aatcactttt ttttcaaggc aatcagggta tattatattg 1200 
tacttcagca cagttttaga gaacaattgt tataattaaa tgataaggta gaatatttct 1260 
gcatataaat tctggctggc gtggaaatat tcttattggt agaaacaact acatcctggt 1320 
catcatcctg cctttctctt tatggttaca atgatataca ctgtttgaga tgaggataaa 1380 
atactctgag tccaaaccgg gcccctctgc taaccatgtt catgccttct tctttttcct 1440 
acagctcctg ggcaacgtgc tggttgttgt gctgtctcat cattttggca aggatcggcc 1500 
ggaacagcat caggaccgac atggaaggtc cagcgttctc aaaacccctt aaagataaga 1560 
ttaacccgtg gaagtcctta atggtcatgg gggtctattt aagagtaggg atggcagaga 1620 
gcccccatca ggtctttaat gtaacctgga gagtcaccaa cctgatgact gggcgtaccgr 1680 
ccaatgccac ctccctttta ggaactgtac aagatgcctt cccaagatta tattttgatc 1740 
tatgtgatct ggtcggagaa gagtgggacc cttcagacca ggaaccatat gtcgggtatg 1800 
gctgcaaata ccccggaggg agaaagcgga cccggacttt tgacttttac gtgtgccctg 1860 
ggcataccgt aaaatcgggg tgtggggggc caagagaggg ctactgtggt gaatggggtt 1920 
gtgaaaccac cggacaggct tactggaagc ccacatcatc atgggaccta atctccctta 1980 
agcgcggtaa caccccctgg gacacgggat gctccaaaat ggcttgtggc ccctgctacg 2040 
acctctccaa agtatccaat tccttccaag gggctactcg agggggcaga tgcaaccctc 2100 
tagtcctaga attcactgat gcaggaaaaa aggctaattg ggacgggccc aaatcgtggg 2160 
gactgagact gtaccggaca ggaacagatc ctattaccat gttctccctg acccgccagg 2220 
tcctcaatat agggccccgc atccccattg ggcctaatcc cgtgatcact ggtcaactac 2280 
ccccctcccg acccgtgcag atcaggctcc ccaggcctcc tcagcctcct cctacaggcg 2340 
cagcctctat agtccctgag actgccccac cttctcaaca acctgggacg ggagacaggc 2400 
tgctaaacct ggtagaagga gcctatcagg cgcttaacct caccaatccc gacaagaccc 2460 
aagaatgttg gctgtgctta gtgtcgggac ctccttatta cgaaggagta gcggtcgtgg 2520 
gcacttatac caatcattct accgccccgg ccagctgtac ggccacttcc caacataagc 2580 
ttaccctatc tgaagtgaca ggacagggcc tatgcatggg agcactacct aaaactcacc 2640 
aggccttatg taacaccacc caaagtgccg gctcaggatc ctactacctt gcagcacccg 2700 
ctggaacaat gtgggcttgt agcactggat tgactccctg cttgtccacc acgatgctca 2760 
atctaaccac agactattgt gtattagttg agctctggcc cagaataatt taccactccc 2820 
ccgattatat gtatggtcag cttgaacagc gtaccaaata taagagggag ccagtatcgt 2880 
tgaccctggc ccttctgcta ggaggattaa ccatgggagg gattgcagct ggaataggga 2940 
cggggaccac tgccctaatc aaaacccagc agtttgagca gcttcacgcc gctatccaga 3000 
cagacctcaa cgaagtcgaa aaatcaatta ccaacctaga aaagtcactg acctcgttgt 3060 
ctgaagtagt cctacagaac cgaagaggcc tagatttgct cttcctaaaa gagggaggtc 3120 
tctgcgcagc cctaaaagaa gaatgttgtt tttatgcaga ccacacggga ctagtgagag 3180 
acagcatggc caaactaagg gaaaggctta atcagagaca aaaactattt gagtcaggcc 3240 
aaggttggtt cgaagggcag tttaatagat ccccctggtt taccacctta atctccacca 3300 
tcatgggacc tctaatagta ctcttactga tcttactctt tggaccctgc attctcaatc. 3360 
gattagttca atttgttaaa gacaggatct cagtagtcca ggctttagtc ctgactcaac 3420 
aataccacca gctaaagcct atagagtacg agccataggg cgcctagtgt tgacaattaa 3480 
tcatcggcat agtatacggc atagtataat acgactcact ataggagggc caccatggcc 3540 
aagttgacca gtgccgttcc ggtgctcacc gcgcgcgacg tcgccggagc ggtcgagttc 3600 
tggaccgacc ggctcgggtt ctcccgggac ttcgtggagg acgacttcgc cggtgtggtc 3660 
cgggacgacg tgaccctgtt catcagcgcg gtccaggacc aggtggtgcc ggacaacacc 3720 
ctggcctggg tgtgggtgcg cggcctggac gagctgtacg ccgagtggtc ggaggtcgtg 3780 
tccacgaact tccgggacgc ctccgggccg gccatgaccg agatcggcga gcagccgtgg 3840 
gggcgggagt tcgccctgcg cgacccggcc ggcaactgcg tgcacttcgt ggccgaggag 3900 
caggactgan nnncggaccg gtcga 3925 

<210> 10 
<211> 58 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: 
Oligonucleotide 



<400> 10 

cggaattcgg atccgagctc ggcccagccg gccaccatga aaacatttaa catttctc 58 



<210> 11 
<211> 32 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: r 
Oligonucleotide 

<400> 11 

gatccatcga taagcttggt ggtaaaactt tt 

<210> 12 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: 
Oligonucleotide 

<400> 12 

gctcttcgga ccctgcattc 

<210> 13 
<211> 34 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: 
Oligonucleotide 

<400> 13 

tagcatggcg ccctatggct cgtactctat aggc 

<210> 14 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer 
<400> 14 

cgcctcatgg ccttcattaa 

<210> 15 
<211> 31 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer 
<400> 15 

tagcatggcg cctcaatcct gagcttcttc c 

<210> 16 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer 



<400> 16 

tctcgcttct gttcgcgcgc 

<210> 17 
<211> 39 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer 
<400> 17 

tcgatcaagc ttgcggccgc ggtggtgggt cggtggtcc 

<210> 18 
<211> 23 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer 
<400> 18 

ctctggctca cagtacgacg tag 

<210> 19 
<2li> 23 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer 
<400> 19 

ccatcaatcc ggtaggtttt ccg 

<210> 20 
<211> 23 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer 
<400> 20 

carrgkttca araacwsycc cac 

<210> 21 
<211> 21 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer 
<220> 

<221> misc_feature 
<222> (12) 

<223> n is any nucleotide 
<400> 21 

agyarvgtag cngggtthag g 

<210> 22 
<211> 26 
<212> DNA 

<213> Artificial Sequence 



<220> 

<223> Description of Artificial Sequence: Primer 
<400> 22 

tccccttgga atactcctgt tttygt 

<210> 23 
<211> 27 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer 
<400> 23 

cattccttgt ggtaaaactt tccaytg 

<210> 24 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer 
<400> 24 

cctcaccctg atcacryttg 

<210> 25 
<211> 21 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer 
<400> 25 

gaattatgtc tgacagaagg g 

<210> 26 
<211> 23 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer 
<400> 26 

gttgacatct gcagagaaag acc 

<210> 27 
<211> 23 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer 
<400> 27 

tctgaggtct gtacacacaa tgg 

<210> 28 
<211> 167 
<212> DNA 

<213> Artificial Sequence 



Lb 



y 



<220> 

<223> Description of Artificial Sequence: Portion of 
construct 

<400> 28 

tctagactga catggcgcgt tcaacgctct caaaacccct taaaaataag gttaacccgc 60 
gaggccccct aatcccctta attcttctga tgctcagagg ggtcagtact gcttcgcccg 120 
gctccagtgc ggcccagccg gccaccatga aaacatttaa catttct 167 

<210> 29 
<211> 103 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Portion of 
construct 

<400> 29 

tacgagccat agggcgccta gtgttgacaa ttaatcatcg gcatagtata cggcatagta 60 
taatacgact cactatagga gggccaccat ggccaagttg acc 103 



f 
i 



